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A biosynthetic method for producing vitamin C (ascorbic acid, L-ascorbic acid, or AA) is disclosed. Such a method includes 
fermentation of a genetically modified microorganism or plant to produce L-ascorbic acid. In particular, the present invention relates 
to the use of microorganisms and plants having at least one genetic modification to increase the action of an enzyme involved in 
the ascorbic acid biosynthetic pathway. Included is the use of nucleotide sequences encoding epimerases, including the endogenous 
GDP-D-mannose:GDP-L-galactose epimerase from the L-ascorbic acid pathway and homologues thereof for the purposes of improving 
the biosynthetic production of ascorbic acid. The present invention also relates to genetically modified microorganisms, such as strains of 
microalgae, bacteria and yeast useful for producing L-ascorbic acid, and to genetically modified plants, useful for producing consumable 
plant food products. 
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VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS 



FIELD OF THE INVENTION 
The present invention relates to vitamin C (L-ascorbic acid) production using 
genetically modified microorganisms and plants. In particular, the present invention 
5 relates to the use of nucleotide sugar epimerase enzymes for the biological production of 
ascorbic acid in plants and microorganisms. 

BACKGROUND OF THE INVENTION 
Nearly all forms of life, both plant and animal, either synthesize ascorbic acid 
(vitamin C) or require it as a nutrient. Ascorbic acid was first identified to be useful as a 

10 dietary supplement for humans and animals for the prevention of scurvy. Ascorbic acid, 
however, also affects human physiological functions such as the adsorption of iron, cold 
tolerance, the maintenance of the adrenal cortex, wound healing, the synthesis of 
polysaccharides and collagen, the formation of cartilage, dentine, bone and teeth, the 
maintenance of capillaries, and is useful as an antioxidant. 

15 For use as a dietary supplement, ascorbic acid can be isolated from natural 

sources, such as rosehips, synthesized chemically through the oxidation of L-sorbose, or 
produced by the oxidative fermentation of calcium D-gluconate by Acetohacter 
suboxidans. Considine, "Ascorbic Acid," Van Nostrand's Scientific Encyclopedia, Vol. 
1, pp. 237-238, (1989). Ascorbic acid (predominantly intracellular) has also been 

20 obtained through the fermentation of strains of the microalga, Chlorella pyrenoidosa. See 
U.S. Patent No. 5,001,059 by Sk&rud, which is assigned to the assignee of the present 
application. It is believed that ascorbic acid is produced inside the chloroplasts of 
photosynthetic microorganisms and functions to neutralize energetic electrons produced 
during photosynthesis. Accordingly, ascorbic acid production is known in photosynthetic 

25 organisms as a protective mechanism. 

Therefore, products and processes which improve the ability to biosynthetically 
produce ascorbic acid are desirable and beneficial for the improvement of human health. 

SUMMARY OF THE INVENTION 
One embodiment of the present invention relates to a method for producing 
30 ascorbic acid or esters thereof in a microorganism. The method includes the steps of: (a) 
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culturing a microorganism having a genetic modification to increase the action of an 
enzyme selected from the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
5 galactose- 1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone 
dehydrogenase; and (b) recovering the ascorbic acid or esters produced by the 
microorganism. Preferably, the genetic modification is a genetic modification to increase 
the action of an enzyme selected from the group of GDP-D-mannose:GDP-L-galactose 
epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P-phosphatase, L-galactose 
10 dehydrogenase, and/or L-galactono-y -lactone dehydrogenase. In one embodiment of the 
method of the present invention, the microorganism further includes a genetic 
modification to decrease the action of an enzyme having GDP-D-mannose as a substrate, 
other than GDP-D-mannose:GDP-L-galactose epimerase. Such a genetic modification 
can include, for example, a genetic modification to decrease the action of GDP-D- 
1 5 mannose-dehydrogenase. 

In one embodiment, the genetic modification is a genetic modification to increase 
the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, which can include GDP-D-mannose:GPP-L-galactose epimerase. In one 
embodiment, the epimerase binds NADPH. In one embodiment of this method, the 
20 genetic modification includes transformation of the microorganism with a recombinant 
nucleic acid molecule that expresses the epimerase. Such an epimerase can have a tertiary 
structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code Ibws. Preferably, the epimerase has a structure 
25 having an average root mean square deviation of less than about 2.5 A, and more 

o 

preferably less than about 1 A, over at least about 25% of Ca positions of the tertiary 
structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. 

In one embodiment, the epimerase comprises a substrate binding site having a 
30 tertiary structure that substantially conforms to the tertiary structure of the substrate 
binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
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atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. Such 
a substrate binding site preferably has a tertiary structure with an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Ca positions of the 
tertiary structure of a substrate binding site of a GDP-4-keto-6-deoxy-D-mannose 
5 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code Ibws. 

In another embodiment, the epimerase comprises a catalytic site having a tertiary 
structure that substantially conforms to the tertiary structure of the catalytic site of a 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates 

10 having Brookhaven Protein Data Bank Accession Code Ibws. Such a catalytic site 
preferably has a tertiary structure with an average root mean square deviation of less than 
about 1 A over at least about 25% of Ca positions of the tertiary structure of a catalytic 
site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
coordinates having Brookhaven Protein Data Bank Accession Code Ibws. The catalytic 

15 site preferably includes the amino acid residues serine, tyrosine and lysine and in one 
embodiment, the tertiary structure positions of the amino acid residues serine, tyrosine and 
lysine substantially conform to tertiary structure positions of residues Serl07, Tyrl36 and 
Lysl40, respectively, as represented by atomic coordinates in Brookhaven Protein Data 
Bank Accession Code Ibws. 

20 In yet another embodiment of this method, the epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in the amino acid sequence align with 100% identity with at least 
about 50%, and in another embodiment with at least about 75%, and in yet another 
embodiment with at least about 90% of non-Xaa residues in SEQ ID NO: 11. In another 

25 embodiment, the epimerase comprises an amino acid sequence having at least 4 
contiguous amino acid residues that are 100% identical to at least 4 contiguous amino acid 
residues of an amino acid sequence selected from the group of SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO: 10. In yet another embodiment, 
the recombinant nucleic acid molecule comprises a nucleic acid sequence comprising at 

30 least about 12 contiguous nucleotides having 100% identity with at least about 12 
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contiguous nucleotides of a nucleic acid sequence selected from the group of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO.5, SEQ ID NO:7 and SEQ ID NO:9. 

In yet another embodiment of this method of the present invention, the epimerase 
comprises an amino acid sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly. In 
5 yet another embodiment, the recombinant nucleic acid molecule comprises a nucleic acid 
sequence that is at least about 15% identical, and in another embodiment, at least about 
20% identical, and in another embodiment, at least about 25% identical, to a nucleic acid 
sequence selected from the group of SEQ ID NO: 1, SEQIDNO:3, SEQE>NO:5, SEQ 
ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Lipman- 
10 Pearson standard default parameters. 

In yet another embodiment of this method of the present invention, the 
recombinant nucleic acid molecule comprises a nucleic acid sequence that hybridizes under 
stringent hybridization conditions to a nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase. The nucleic acid sequence encoding the GDP-4- 
15 keto-6-deoxy-D-mannose epimerase/reductase includes nucleic acid sequences selected 
from the group of SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:5, and the GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase can include an amino acid sequence selected 
Aonvthegjoi^^ 

In one embodiment of the method of the present invention, the microorganism is 
20 selected from the group of bacteria, fungi and microalgae. In one embodiment, the 
microorganism is acid-tolerant. Preferred bacteria include, but are not limited to 
Azotobacter and Pseudomonas. Preferred fungi include, but are not limited to, yeast, 
including, but not limited to Saccharomyces yeast. Preferred microalgae include, but are 
not limited to, microalgae of the genera Prototheca and Chlorella, with microalgae of the 
25 genus Prototheca being particularly preferred. 

In yet another embodiment of the method of the present invention, the 
microorganism is acid-tolerant and the step of culturing is conducted at a pH of less than 
about 6.0, and more preferably, at a pH of less than about 5.5, and even more preferably, 
at a pH of less than about 5.0. The step of culturing can be conducted in a fermentation 
30 medium that comprises a carbon source other than D-mannose in one embodiment, and 
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in another embodiment, the step of culturing is conducted in a fermentation medium that 
comprises glucose as a carbon source. 

In yet another embodiment of the present method, the step of culturing is 
conducted in a fermentation medium that is magnesium (Mg) limited. Preferably, the step 
5 of culturing is conducted in a fermentation medium that is Mg limited during a cell growth 
phase. In one embodiment, the fermentation medium includes less than about 0.5 g/L of 
Mg during a cell growth phase, and more preferably, less than about 0.2 g/L of Mg during 
a cell growth phase, and even more preferably, less than about 0. 1 g/L of Mg during a cell 
growth phase. 

10 Another embodiment of the present invention relates to a microorganism for 

producing ascorbic acid or esters thereof. The microorganism has a genetic modification 
to increase the action of an enzyme selected from the group of hexokinase, glucose 
phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D- 
mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L- 

15 galactose phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, 
and/or L-galactono-y -lactone dehydrogenase. Preferably, the genetic modification is a 
genetic modification to increase the action of an enzyme selected from the group of GDP- 
D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l- 
P-phosphatase, L-galactose dehydrogenase, and/or I^galactono-y -lactone dehydrogenase, 

20 and even more preferably, to increase the action of GDP-D-mannose:GDP-L-galactose 
epimerase. 

In one embodiment, the microorganism has been genetically modified to express 
a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose, wherein the epimerase has a tertiary structure 

25 having an average root mean square deviation of less than about 2.5 A over at least about 
25% of Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code lbws. In another embodiment, the microorganism has been 
genetically modified to express a recombinant nucleic acid molecule encoding an 

30 epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein the 
epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 11 using a 
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CLUSTAL alignment program, wherein amino acid residues in the amino acid sequence 
align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 11. 
Preferred microorganisms are disclosed as for the method discussed above. 

Yet another embodiment of the present invention relates to a plant for producing 
5 ascorbic acid or esters thereof. Such a plant has a genetic modification to increase the 
action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose-l-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone 
10 dehydrogenase. In a preferred embodiment, the genetic modification is a genetic 
modification to increase the action of an enzyme selected from the group of GDP-D- 
mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone dehydrogenase, 
and in a more preferred embodiment, the genetic modification is a genetic modification 
15 to increase the action of GDP-D-mannose:GDP-L-galactose epimerase. 

In one embodiment, the plant further comprises a genetic modification to decrease 
the action of an enzyme having GDP-D-mannose as a substrate other than GDP-D- 
magnosgGDPi-g^ 

modification to decrease the action of GDP-D-mannose-dehydrogenase. Such a plant also 
20 includes a plant that has been genetically modified to express a recombinant nucleic acid 
molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein the epimerase has a tertiary structure having an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Ca positions of the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
25 by atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. In 
another embodiment, such a plant has been genetically modified to express a recombinant 
nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D- 
mannose to GDP-L-galactose, wherein the epimerase comprises an amino acid sequence 
that aligns with SEQ ID NO:l 1 using a CLUSTAL alignment program, wherein amino 
30 acid residues in the amino acid sequence align with 1 00% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO: 1 1 . 
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In one embodiment, a plant for producing ascorbic acid or esters thereof according 
to the present invention is a microalga. Preferred microalgae include, but are not limited 
to microalgae of the genera Prototheca and Chlorella, with microalga of the genus 
Protoiheca being particularly preferred. In another embodiment, the plant is a higher 
5 plant, with consumable higher plants being more preferred. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 A is a schematic drawing of the pathway from glucose to GDP-D-mannose 
in plants. 

Fig. IB is a schematic drawing of the pathway from GDP-D-mannose to L- 
1 0 galactose- 1 -phosphate in plants. 

Fig. 1C is a schematic drawing of the pathway from L-galactose to L-ascorbic acid 
in plants. 

Fig. 2A is a schematic drawing of selected carbon flow from glucose in 
Prototheca. 

15 Fig. 2B is a schematic drawing of selected carbon flow from glucose in 

Prototheca. 

Fig. 3 is a schematic drawing that shows the lineage of mutants derived from 
Prototheca moriformis ATCC 75669, and their ability to produce L-ascorbic acid. 

Fig. 4 is a bar graph illustrating the conversion of substrates by resting cells of 
20 strain NA45-3 following growth in media containing various magnesium concentrations 
and resuspension in media containing various magnesium concentrations. 

Fig. 5 is a line graph showing the relationship between specific ascorbic acid 
formation in cultures of Prototheca strains and the specific activity of GDP-D- 
mannose: GDP-L-galactose epimerase in extracts prepared from cells harvested from the 
25 same cultures. 

Fig. 6 is a line graph showing the relationship between specific epimerase activity 
and the degree of magnesium limitation in two strains, ATCC 75669 and EMS13-4. 

Fig. 7 depicts the overall catalytic mechanism of GDP-D-mannose: GDP-L- 
galactose epimerase proposed by Barber (1979, J. Biol Chem. 254:7600-7603). 
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f 

Fig. 8A depicts the catalytic mechanism of GDP-D-mannose-4,6-dehydratase 
(converts GDP-D-mannose to GDP-4-keto-6-deoxy-D-mannose). 

Fig. 8B depicts the catalytic mechanism of GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (converts GDP-4-keto-6-deoxy-D-mannose to GDP-L-fiicose) 
5 (Chang, et ah, 1988, 1 BioL Chent 263:1693-1697; Barber, 1980, Plant Physiol 66:326- 
329). 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to a biosynthetic method and production 
microorganisms and plants for producing vitamin C (ascorbic acid, L-ascorbic acid, or 
10 AA). Such a method includes fermentation of a genetically modified microorganism to 
produce L-ascorbic acid. In particular, the present invention relates to the use of 
nucleotide sequences encoding epimerases, including the endogenous GDP-D- 
mannose:GDP-L-galactose epimerase from the L-ascorbic acid pathway, as well as 
epimerases having structural homology (e.g., by nucleotide/amino acid sequence and/or 
1 5 tertiary structure of the encoded protein) to GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases, or UDP-galactose 4-epimerases, for the purposes of improving the 
)ios\Tithelie^ prLQ^ 

modified microorganisms, such as strains of microalgae, bacteria and yeast useful for 
producing L-ascorbic acid, and to genetically modified plants, useful for producing 

20 consumable plant food products. 

One embodiment of the present invention relates to a method to produce L- 
ascorbic acid by fermentation of a genetically modified microorganism. This method 
includes the steps of (a) culturing in a fermentation medium a microorganism having a 
genetic modification to increase the action of an enzyme selected from the group of 

25 hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannomutase, GDP-mannose pyrophosphorylase, GDP-D-manno se : GDP-L- 
galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L- 
galactose dehydrogenase, and L-galactono-y -lactone dehydrogenase; and (b) recovering 
L-ascorbic acid or esters thereof. The various enzymes in this list represent the enzymes 

30 involved in the vitamin C biosynthetic pathway in plants. It is uncertain at this time 
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whether the enzyme represented by GDP-L-galactose phosphorylase is actually a 
phosphorylase or a pyrophosphorylase (i.e., GDP-L-galactose pyrophosphorylase). 
Therefore, use of the term "GDP-L-galactose phosphorylase" herein refers to either GDP- 
L-galactose phosphorylase or GDP-L-galactose pyrophosphorylase. In one aspect of the 
5 invention, this method includes the step of culturing in a fermentation medium a 
microorganism having a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. This aspect of the present 
invention is discussed in detail below. 

Another embodiment of the present invention relates to a genetically modified 

10 microorganism for producing L-ascorbic acid or esters thereof. Another embodiment of 
the present invention relates to a genetically modified plant for producing L-ascorbic acid 
or esters thereof. Both genetically modified microorganisms (e.g., bacteria, yeast, 
microalgae) and plants (e.g., higher plants, microalgae) have a genetic modification to 
increase the action of an enzyme selected from the group of hexokinase, glucose 

15 phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-mannose 
pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and/or L- 
galactono-y -lactone dehydrogenase. In a preferred embodiment, both genetically 
modified microorganisms (e.g., bacteria, yeast, microalgae) and plants (e.g., higher plants, 

20 microalgae) have a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. In one embodiment, the 
genetic modification includes the transformation of the microorganism or plant with the 
epimerase as described above. 

To produce significantly high yields of L-ascorbic acid by the method of the 

25 present invention, a plant and/or microorganism is genetically modified to enhance 
production of L-ascorbic acid. As used herein, a genetically modified plant (such as a 
higher plant or microalgae) or microorganism, such as a microalga {Prototheca, 
Chlorella), Escherichia coli y or a yeast, is modified (i.e., mutated or changed) within its 
genome and/or by recombinant technology (i.e., genetic engineering) from its normal (i.e., 

30 wild-type or naturally occurring) form. In a preferred embodiment, a genetically modified 
plant or microorganism according to the present invention has been modified by 
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recombinant technology. Genetic modification of a plant or microorganism can be 
accomplished using classical strain development and/or molecular genetic techniques, 
include genetic engineering techniques. Such techniques are generally disclosed herein 
and are additionally disclosed, for example, in Sambrook et al., 1989, Molecular Cloning: 
5 A Laboratory Manual, Cold Spring Harbor Labs Press; Roessler, 1995, Plant Lipid 
Metabolism, pp. 46-48; and Roessler et al., 1994, in Byconversion for Fuels, Kmmel et 
al. eds., American Chemical Society, Washington D.C., pp 255-70). These references are 
incorporated by reference herein in their entirety. 

In some embodiments, a genetically modified plant or microorganism can include 
10 a natural genetic variant as well as a plant or microorganism in which nucleic acid 
molecules have been inserted, deleted or modified, including by mutation of endogenous 
genes (e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such 

■ 

a manner that the modifications provide the desired effect within the plant r 
microorganism, As discussed above, a genetically modified plant or microorganism 
15 includes a plant or microorganism that has been modified using recombinant technology. 

As used herein, genetic modifications which result in a decrease in gene 
expression, an increase in inhibition of gene expression or inhibition of a gene product 

decrease in the function of the gene product can be referred to as inactivation (complete 
20 or partial), deletion, interruption, blockage, down-regulation, or decreased action of a 
gene. For example, a genetic modification in a gene which results in a decrease in the 
function of the protein encoded by such gene can be the result of a complete deletion of 
the gene encoding the protein (i.e., the gene does not exii> s and therefore the protein does 
not exist), a mutation in the gene encoding the protein which results in incomplete or no 
25 translation of the protein (e.g., the protein is not expressed), or a mutation in the gene 
which decreases or abolishes the natural function of the protein (e.g., a protein is 
expressed which has decreased or no enzymatic activity). 

Genetic modifications which result in an increase in gene expression or function 
can be referred to as amplification, overproduction, overexpression, activation, 
30 enhancement, addition, up-regulation or increased action of a gene. Additionally, a genetic 
modification to a gene which modifies the expression, function, or activity of the gene can 
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have an impact on the action of other genes and their expression products within a given 
metabolic pathway (e.g., by inhibition or competition). In this embodiment, the action 
(e.g., activity) of a particular gene and/or its product can be affected (i.e., upregulated or 
downregulated) by a genetic modification to another gene within the same metabolic 
5 pathway, or to a gene within a different metabolic pathway which impacts the pathway of 
interest by competition, inhibition, substrate formation, etc. 

In general, a plant or microorganism having a genetic modification that affects L- 
ascorbic acid production has at least one genetic modification, as discussed above, which 
results in a change in the L-ascorbic acid production pathway as compared to a wild-type 

10 plant or microorganism grown or cultured under the same conditions. Such a 
modification in an L-ascoibic acid production pathway changes the ability of the plant or 
microorganism to produce L-ascorbic acid. According to the present invention, a 
genetically modified plant or microorganism preferably has an enhanced ability to produce 
I^ascoibic acid compared to a wild-type plant or microorganism cultured under the same 

15 conditions. 

The present invention is based on the present inventors' discovery of the 
biosynthetic pathway for I^ascorbic acid (vitamin C) in plants and microorganisms. Prior 
to the present invention, the metabolic pathway by which plants produce L-ascorbic acid, 
was not completely elucidated. The present inventors have demonstrated that L-ascorbic 

20 acid production in plants, including L-ascorbic acid-producing microorganisms (e.g., 
microalgae), is a pathway which uses GDP-D-mannose and involves sugar phosphates and 
NDP-sugars. In addition, the present inventors have made the surprising discovery that 
both L-galactose and L-galactono-Y-lactone can be rapidly converted into L-ascorbic acid 
in L-ascorbic acid-producing microalgae, including Prototheca and Chloretta 

25 pyrenoidosa. The entire pathway for L-ascorbic acid production in plants is set forth in 
Figs. 1A-1C. More particularly, Fig. 1 A shows that the production of L-ascorbic acid in 
plants proceeds through the production of mannose intermediates to GDP-D-mannose, 
followed by the conversion of GDP-D-mannose to GDP-L-galactose by GDP-D- 
mannose:GDP-L-galactose epimerase (also known as GDP-D-mannose-3,5-epimerase) 

30 (Fig. IB), and then by the subsequent progression to L-galactose-l-P, L-galactose, L- 
galactonic acid (optional), L-galactono-Y-lactone, and L-ascorbic acid (Fig. 1C). Fig. IB 
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also illustrates alternate pathways for the use of various intermediates, such as GDP-D- 
mannose, Certain aspects of this pathway have been independently described in a 
publication (Wheeler, et aL, 1998, Nature 393:365-369), incorporated herein by reference 
in its entirety. 

5 Points within the L-ascorbic acid production pathway which can be targeted by 

genetic modification to affect the production of L-ascorbic acid can generally be 
catagorized into at least one of the following pathways: (a) pathways affecting the 
production of GDP-D-mannose (e.g., pathways for converting a carbon source into GDP- 
D-mannose); (b) pathways for converting GDP-D-mannose into other compounds, (c) 

10 pathways associated with or downstream of the action of GDP-D-mannose:GDP-L- 
galactose epimerase, (d) pathways which compete for substrates involved in the 
production of any of the intermediates within the l-ascorbic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-l-phosphate, L- 
galactose, L-galactono-y-lactone, and/or L-ascorbic acid; and (e) pathways which inhibit 

15 production of any of the intermediates within the L-ascorbic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-l-phosphate, L- 
galactose, L-galactono-y-lactone, and/or L-ascorbic acid. 

invention typically has at least one genetic modification in the L-ascorbic acid production 
20 pathway which resuks in an enhanced production of L-ascorbic acid. In one embodiment, 
a genetically modified plant or microorganism has at least one genetic modification that 
results in: (a) an enhanced production of GDP-D-mannose; (b) an inhibition of pathways 
which convert GDP-D-mannose into compounds other than GDP-L-galactose; (c) an 
enhancement of action of the GDP-D-mannose:GDP-L-galactose epimerase; (d) an 
25 enhancement of the action of enzymes downstream of the GDP-D-mannose:GDP-L- 
galactose epimerase; (e) an inhibition of pathways which compete for substrates involved 
in the production of any of the intermediates within the L-ascorbic acid production 
pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-l- 
phosphate, L-galactose, L-galactono-y-lactone, and/or L-ascorbic acid; and (e) an 
30 inhibition of pathways which inhibit production of any of the intermediates within the L- 
ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L- 
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galactose, L-galactose-1 -phosphate, L-galactose, L-galactono-y-lactone, and/or L- 
ascorbic acid. 

An enhanced production of GDP-D-mannose by genetic modification of the plant 
or microorganism can be achieved by, for example, overexpression of enzymes such as 
5 hexokinase, glucose phosphate isomerase, phosphomannose isomerase (PMI), 
phosphomannomutase (PMM) and/or GDP-D-mannose pyrophosphoiylase (GMP). 
Inhibition of pathways which convert GDP-D-mannose to compounds other than GDP-L- 
galactose can be achieved, for example, by modifications which inhibit polysaccharide 
synthesis, GDP-D-rhamnose synthesis, GDP-L-fucose synthesis and/or GDP-D- 

10 mannuronic acid synthesis. An increase in the action of the GDP-D-mannose:GDP-L- 
galactose epimerase and of enzymes downstream of the epimerase in the L-ascorbic acid 
production pathway can be achieved by genetic modifications which include, but are not 
limited to: overexpression of the epimerase gene (i.e, by overexpression of a recombinant 
nucleic acid molecule encoding the epimerase gene or a homologue thereof (discussed in 

15 detail below), and/or by mutation of the endogenous or recombinant gene to enhance 
expression of the gene) and/or overexpression of genes downstream of the epimerase 
which encode subsequent enzymes in the L-ascorbic acid pathway. Finally, metabolic 
pathways which compete with or inhibit the L-ascorbic acid production pathway can be 
inhibited by deleting or mutating enzymes, substrates or products which either inhibit or 

20 compete for an enzyme, substrate or product in the L-ascorbic acid pathway. 

As discussed above, a genetically modified plant or microorganism useful in the 
method of the present invention can hive at least one genetic modification (e.g., mutation 
in the endogenous gene or addition of a recombinant gene) in a gene encoding an enzyme 
involved in the L-ascorbic acid production pathway. Such genetic modifications 

25 preferably increase (i.e., enhance) the action of such enzymes such that L-ascorbic acid 
is preferentially produced as compared to other possible end products in related metabolic 
pathways. Such genetic modifications include, but are not limited to, overexpression of 
the gene encoding such enzyme, and deletion, mutation, or downregulation of genes 
encoding competitors or inhibitors of such enzyme. Preferred enzymes for which the 

30 action of the gene encoding such enzyme can be genetically modified include: hexokinase, 
glucose phosphate isomerase, phosphomannose isomerase (PMI), phosphomannomutase 
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(PMM), GDP-D-mannose pyrophosphorylase (GMP), GDP-I>mannose:GDP-L-galactose 
epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P-phosphatase, L-galactose 
dehydrogenase, and/or L-galactono-Y-lactone dehydrogenase. More preferably, a 
genetically modified plant or microorganism useful in the present invention has a genetic 
5 modification which increases the action of an enzyme selected from the group of GDP-D- 
mannose: GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone dehydrogenase. 
Even more preferably, a genetically modified plant or microorganism useful in the present 
invention has a genetic modification which increases the action of GDP-D-mannose: GDP- 

10 L-galactose epimerase. These enzymes and the reactions catalyzed by such enzymes are 
illustrated in Figs. 1A-1C. 

Prior to the present invention, without knowing the L-ascorbic acid biosynthetic 
(i.e., production) pathway, previous mutagenesis and screening efforts were limited in that 
only non-lethal mutations could be detected. One embodiment of the present invention 

15 relates to elimination of a key competing enzyme that diverts carbon flow from L-ascorbic 
acid synthesis. If such enzyme is absolutely required for growth on glucose, then mutants 
lacking the enzyme (and, therefore, having increased carbon flow to L-ascorbic acid) 

One such enzyme is phosphofructokinase (PFK) (See Fig. 2A). PFK is required for 
20 growth on glucose, and is the major step drawing carbon away from L-ascorbic acid 
biosynthesis (Fig. 2A). Elimination of PFK would render the cells nonviable on glucose- 
based media. Selection of a conditional mutant where PFK was inactivated by temperature 
shift, for example, may allow development of a L-ascorbic acid process where cell growth 
is achieved under permissive fermentation conditions, and L-ascorbic acid production 
25 (from glucose) is initiated by a shift to non-permissive condition. In this example, the 
temperature shift would eliminate carbon flow from glucose to glycolysis via PFK, thereby 
shunting carbon into the L-ascorbic acid branch of metabolism. This approach has 
application not only in natural L-ascorbic acid producing organisms, but also in L-ascorbic 
acid recombinant systems (genetically engineered plant or microorganisms) as discussed 
30 herein. 
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Knowing the identity and mechanism of the rate-limiting pathway enzymes in the 
L-ascorbic acid production pathway allows for design of specific inhibitors of the enzymes 
that are also growth inhibitory. Selection of mutants resistant to the inhibitors allows for 
the isolation of strains that contain L-ascorbic acid-pathway enzymes with more favorable 
5 kinetic properties. Therefore, one embodiment of the present invention is to identify 
inhibitors of the enzymes that are also growth inhibitory. These inhibitors are then used 
to select genetic mutants that overcome this inhibition and produce L-ascorbic acid at high 
levels. In this embodiment, the resultant plant or microorganism is a non-recombinant 
strain which can then be further modified by recombinant technology, if desired. In 

10 recombinant L-ascorbic acid producing strains, random mutagenesis and screening can be 
used as a final step to increase L-ascorbic acid production. 

In yet another embodiment genetic modifications are made to an L-ascorbic acid 
producing organism directly. This allows one to build upon a base of data acquired during 
prior classical strain improvement efforts, and perhaps more importantly, allows one to 

15 take advantage of undefined beneficial mutations that occurred during classical strain 
improvement. Furthermore, fewer problems are encountered when expressing native, 
rather than heterologous, genes. The most advanced system for development of genetic 
systems for microalgae has been developed for Chlamydomonas reinhardtii. Preferably, 
development of such a genetically modified production organism would include: isolation 

20 of mutant(s) with a specific nutritional requirement for use with a cloned selectable marker 
gene (similar to the ura3 mutants used in yeast and fungal systems); a cloned selectable 
marker such as URA 3 or alternatively, identification and cloning of a gene that specifies 
resistance to a toxic compound (this would be analogous to the use of antibiotic resistance 
genes in bacterial systems, and, as is the case in yeast and other fungi, a means of 

25 inserting/removing the marker gene repeatedly would be required, unless several different 
selectable markers were developed); a transformation system for introducing DNA into 
the production organism and achieving stable transformation and expression; and, a 
promoter system (preferably several) for high-level expression of cloned genes in the 
organism. 

30 Another embodiment of the present invention, discussed in detail below, is to place 

key genes or allelic variants and homologues thereof from L-ascorbic acid producing 
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organisms (i.e., higher plants and microalgae) into a plant or microorganism that is more 
amenable to molecular genetic manipulation, including endogenous L-ascorbic acid 
producing microorganisms and suitable plants. For example, it is possible to identify a 
suitable non-pathogenic organism based on the requirement of growth (on glucose) at low 
5 pH (i.e., acid-tolerant organisms, discussed in detail below). 

One suitable candidate for recombinant production in any suitable host organism 
is the gene (nucleic acid molecule) encoding GDP-D-mannose:GDP-L-galactose 
epimerase and homologues of the GDP-D-mannose:GDP-L-galactose epimerase, as well 
as any other epimerase that has structural homology at the primary (i.e., sequence) or 

10 tertiary (i.e., three dimensional) level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase, or to a UDP-galactose 4-epimerase. Many microorganisms produce GDP-D- 
mannose as a precursor to exopolysaccharide and glycoprotein production, even though 
such organisms may not make L-ascorbic acid. This aspect of the present invention is 
discussed in detail below. 

15 Referring to Figs. 1 A-1C, at least some of the enzymes from glucose-6-phosphate 

to GDP-D-mannose are present in many organisms. In fact, the entire sequence is present 
in bacteria such as Azotobacter vinelandii and Pseudomonas aeruginosa, and make up the 

possible that the only thing preventing these organisms from producing L-ascorbic acid 
20 could be the lack of GDP-D-mannose:GDP-L-galactose epimerase. The presence of PMI, 
PMM and GMP (see Fig. 1 A) in so many organisms is important for two reasons. First, 
these organisms themselves could serve as alternate hosts for L-ascorbic acid production, 
by building on the existing early pathway enzymes and adding the required cloned genes 
(the epimerase and possibly others). Second, the genes encoding PMI, PMM and GMP 
25 can be cloned into a new organism where, together with the cloned epimerase, they would 
encode the overall pathway from glucose-6-phosphate to GDP-L- galactose. 

In order to screen genomic DNA or cDNA libraries from different organisms and 
to isolate nucleic acid molecules encoding these enzymes such as the GDP-D- 
mannose:GDP-L-galactose epimerase, one can use any of a variety of standard molecular 
30 and biochemical techniques. For example, the GDP-D-mannose:GDP-L-galactose 
epimerase can be purified from an organism such as Prototheca, the N-terminal amino 
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acid sequence can be determined (including, if necessary, the sequence of internal peptide 
fragments), and this information can be used to design degenerate primers for amplifying 
a gene fragment from the organism's DNA. This fragment would then be used to probe 
the library, and subsequently fragments that hybridize to the probe would be cloned in that 
5 organism or another suitable production organism. There is ample precedent for plant 
enzymes being expressed in an active form in bacteria, such as K colt. Alternatively, 
yeast are also a suitable candidate for developing a heterologous system for L-ascorbic 
acid production. 

It is to be understood that the present invention discloses a method comprising the 

10 use of a microorganism with an ability to produce commercially useful amounts of L- 
ascorbic acid in a fermentation process (i.e., preferably an enhanced ability to produce L- 
ascorbic acid compared to a wild-type microorganism cultured under the same 
conditions). This method is achieved by the genetic modification of one or more genes 
encoding a protein involved in an L-ascorbic add pathway which results in the production 

IS (expression) of a protein having an altered (e.g., increased or decreased) function as 
compared to the corresponding wild-type protein. Preferably, such genetic modification 
is achieved by recombinant technology. It will be appreciated by those of skill in the art 
that production of genetically modified plants or microorganisms having a particular 
altered function as described elsewhere herein (e.g., an enhanced ability to produce GDP- 

20 D-mannose:GDP-L-galactose epimerase), such as by transformation of the plant or 
microorganism with a nucleic acid molecule which encodes a particular enzyme, can 
produce many organisms meeting th^ given functional requirement, albeit by virtue of a 
variety of different genetic modifications. For example, different random nucleotide 
deletions and/or substitutions in a given nucleic acid sequence may all give rise to the 

25 same phenotypic result (e.g., decreased enzymatic activity of the protein encoded by the 
sequence). The present invention contemplates any such genetic modification which 
results in the production of a plant or microorganism having the characteristics set forth 
herein. 

A microorganism to be used in the fermentation method of the present invention 
30 is preferably a bacterium, a fungus, or a microalga which has been genetically modified 
according to the disclosure above. More preferably, a microorganism useful in the present 
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invention is a microalga which is capable of producing L-ascorbic acid, although the 
present invention includes microorganisms which are genetically engineered to produce 
L-ascorbic acid using the knowledge of the key components of the pathway and the 
guidance provided herein. Even more preferably, a microorganism useful in the present 
5 invention is an acid-tolerant microorganism, such as microalgae of the genera Protoiheca 
and Chlorella. Acid-tolerant yeast and bacteria are also known in the art. Acid-tolerant 
microorganisms are discussed in detail below. Particularly preferred microalgae include 
microalgae of the genera, Prototheca and Chlorella, with Protoiheca being most 
preferred. All known species of Prototheca produce L-ascorbic acid. Production of 

10 ascorbic acid by microalgae of the genera Prototheca and Chlorella is described in detail 
in U.S. Patent No. 5,792,631, issued August 11, 1998, and in U.S. Patent No. 5,900,370, 
issued May 4, 1999, both of which are incorporated herein by reference in their entirety. 
Preferred bacteria for use in the present invention include, but are not limited to, 
Azotobacter, Pseudomonas, and Escherichia, although acid-tolerant bacteria are more 

15 preferred. Preferred fungi for use in the present invention include yeast, and more 
preferably, yeast of the genus, Saccharomyces. A microorganism for use in the 
fermentation method of the present invention can also be referred to as a production 

as microorganisms or as plants. 
20 A preferred plant to genetically modify according to the present invention is 

preferably a plant suitable for consumption by animals, including humans. More 
preferably, such a plant is a plant that naturally produces L-ascorbic acid, although other 
plants can be genetically modified to produce L-ascorbic acid using the guidance provided 
herein. 

25 The L-ascorbic acid production pathways of the microalgae Prototheca and 

Chlorella pyrenoidosavnUbe addressed as specific embodiments of the present invention 
are described below. It will be appreciated that other plants and, in particular, other 
microorganisms, have similar L-ascorbic acid pathways and genes and proteins having 
similar structure and function within such pathways. It will also be appreciated that plants 

30 and microorganisms which do not naturally produce L-ascorbic acid can be modified 
according to the present invention to produce L-ascorbic acid. As such, the principles 
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discussed below with regard to Prototheca and Chloretta pyrenoidosa are applicable to 
other plants and microorganisms, including genetically modified plants and 
microorganisms. 

In one embodiment of the present invention, the action of an enzyme in the L~ 
5 ascorbic acid production pathway is increased by amplification of the expression (i.e., 
overexpression) of an enzyme in the pathway, and particularly, the GDP-D- 
mannose:GDP-L-galactose epimerase, homologues of the epimerase, and/or enzymes 
downstream of the epimerase. Overexpression of an enzyme can be accomplished, for 
example, by introduction of a recombinant nucleic acid molecule encoding the enzyme. 

10 It is preferred that the gene encoding an enzyme in the L-ascorbic acid production 
pathway be cloned under control of an artificial promoter. The promoter can be any 
suitable promoter that will provide a level of enzyme expression required to maintain a 
sufficient level of L-ascorbic acid in the production organism. Preferred promoters are 
constitutive (rather than inducible) promoters, since the need for addition of expensive 

15 inducers is therefore obviated. The gene dosage (copy number) of a recombinant nucleic 
acid molecule according to the present invention can be varied according to the 
requirements for maximum product formation. In one embodiment, the recombinant 
nucleic acid molecule encoding a gene in the L-ascorbic acid production pathway is 
integrated into the chromosomes of the microorganism. 

20 It is another embodiment of the present invention to provide a microorganism 

having one or more enzymes in the L-ascorbic acid production pathway with improved 
affinity for its substrates. An enzyme with improved affinity for its substrates can be 
produced by any suitable method of genetic modification or protein engineering. For 
example, computer-based protein engineering can be used to design an epimerase protein 

25 with greater stability and better affinity for its substrate. See for example, Maulik et al., 
1997, Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, 
Inc., which is incorporated herein by reference in its entirety. 

Recombinant nucleic acid molecules encoding proteins in the L-ascorbic acid 
production pathway can be modified to enhance or reduce the function (i.e., activity) of 

30 the protein, as desired to increase L-ascorbic acid production, by any suitable method of 
genetic modification. For example, a recombinant nucleic acid molecule encoding an 
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enzyme can be modified by any method for inserting, deleting, and/or substituting 
nucleotides, such as by error-prone PCR. In this method, the gene is amplified under 
conditions that lead to a high frequency of misincorporation errors by the DNA 
polymerase used for the amplification. As a result, a high frequency of mutations are 
5 obtained in the PCR products. The resulting gene mutants can then be screened for 
enhanced substrate affinity, enhanced enzymatic activity, or reduced/increased inhibitory 
ability by testing the mutant genes for the ability to confer increased L-ascorbic acid 
production onto a test microorganism, as compared to a microorganism carrying the non- 
mutated recombinant nucleic acid molecule. 

10 Another embodiment of the present invention includes a microorganism in which 

competitive side reactions are blocked, including all reactions for which GDP-D-mannose 
is a substrate other than the production of L-ascorbic acid. In a preferred embodiment, 
a microorganism having complete or partial inactivation (decrease in the action of) of 
genes encoding enzymes which compete with the GDP-D-mannose:GDP-L-galactose 

15 epimerase for the GDP-D-mannose substrate is provided. Such enzymes include GDP-D- 
mannase and/or GDP-D-mannose-dehydrogenase. As used herein, inactivation of a gene 
can refer to any modification of a gene which results in a decrease in the activity (i.e., 

deletion of activity. 

20 As discussed above, a particularly preferred aspect of the method to produce L~ 

ascorbic acid by fermentation of a genetically modified microorganism of the present 
invention includes the step of culturing in a fermentation medium a microorganism having 
a genetic modification to increase the action of an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose. According to the present invention, such an 

25 epimerase can include the endogenous GDP-D-mannose : GDP-L-galactose epimerase from 
the L-ascorbic acid pathway, described above, as well as any other epimerase that has 
structural homology at the primary (i.e., sequence) or tertiary (i.e., three dimensional) 
level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, or to a UDP-galactose 
4-epimerase. Such structural homology is discussed in detail below. Preferably, such an 

30 epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L- 
galactose. In one embodiment, the genetic modification includes transformation of the 
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microorganism with a recombinant nucleic acid molecule that expresses such an 
epimerase. 

Therefore, the epimerase encompassed in the method and organisms of the 
present invention includes the endogenous epimerase which operates in the naturally 
5 occurring ascorbic acid biosynthetic pathway (referred to herein as GDP-D- 
mannose:GDP-L-galactose epimerase), GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases, and any other epimerase which is capable of catalyzing the conversion of 
GDP-D mannose to GDP-L-galactose and which is structurally homologous to a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase. An 
10 epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose according 
the present invention can be identified by biochemical and functional characteristics as well 
as structural characteristics. For example, an epimerase according to the present invention 

,v 

is capable of acting on GDP-D-mannose as a substrate, and more particularly, such an 
epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L- 

15 galactose. It is to be understood that such capabilities need not necessarily be the normal 
or natural function of the epimerase as it acts in its endogenous (i.e., natural) environment. 
For example, GDP-4-keto-6-deoxy-D-mannose epimerase/reductase in its natural 
environment under normal conditions, catalyzes the conversion of GDP-D-mannose to 
GDP-L-fiicose and does not act directly on GDP-D-mannose (See Fig.8A, B), however, 

20 such an epimerase is encompassed by the present invention for use in catalyzing the 
conversion of GDP-D-mannose to GDP-L-galactose for production of ascorbic acid, to 
the extent that it is capable o£ or 'can be modified to be capable of, catalyzing the 
conversion of GDP-D-mannose to GDP-L-galactose. Therefore, the present invention 
includes epimerases which have the desired enzyme activity for use in production of 

25 ascorbic acid, are capable of having such desired enzyme activity, and/or are capable of 
being modified or induced to have such desired enzyme activity. 

In one embodiment, an epimerase according to the present invention includes an 
epimerase that catalyzes the reaction depicted in Fig. 7. In another embodiment, an 
epimerase according to the present invention includes an epimerase that catalyzes the first 

30 of the reactions depicted in Fig. 8B. In one embodiment, an epimerase according to the 
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present invention binds to NADPH. In another embodiment, an epimerase according to 
the present invention is NADPH-dependent for enzyme activity. 

As discussed above, the present inventors have discovered that a key enzyme in 
L-ascorbic acid biosynthesis in plants and microorganisms is GDP-D-mannose:GDP-L- 
5 galactose epimerase (refer to Figs. 1 A-1C). One embodiment of the invention described 
herein is directed to the manipulation of this enzyme and structural homologues of this 
enzyme to increase L-ascorbic acid production in genetically engineered plants and/or 
microorganisms. More particularly, the GDP-D-mannose:GDP-L-galactose epimerase of 
the L-ascorbic acid pathway and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases 

10 are believed to be structurally homologous at both the sequence and tertiary structure 
level; a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is believed to be capable 
of functioning in the L-ascorbic add biosynthetic pathway; and a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase or homologue thereof may be superior to a GDP-D- 
mannose-GDP-L-galactose epimerase for increasing L-ascorbic acid production in 

15 genetically engineered plants and/or microorganisms. Furthermore, the present inventors 
disclose the use of a nucleotide sequence encoding all or part of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase as a probe to identify the gene encoding GDP-D- 




of a nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose 
20 epimerase/reductase to design oligonucleotide primers for use in a PCR-based strategy for 

identifying and cloning a gene encoding GDP-D-mannose:GDP-L-galactose epimerase. 
Without being bound by theory, the present inventors believe that the following 

evidence supports the novel concept that the GDP-D-mannose:GDP-L-galactose 

epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases have significant 
25 structural homology at the level of sequence and/or tertiary structure, and that the GDP-4- 

keto-6-deoxy-D-mannose epimerase/reductases and/or homologues thereof would be 

useful for production of ascorbic acid and/or for isolating the endogenous GDP-D- 

mannose:GDP-L-galactose epimerase. 

Although prior to the present invention, it was not known that the GDP-D- 
30 mannose:GDP-L-galactose epimerase enzyme (also known as GDP-D-mannose-3 , 5- 

epimerase) plays a critical role in L-ascorbic acid biosynthesis, this enzyme was previously 
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described to catalyze the overall reversible reaction between GDP-D-mannose and GDP- 
L-galactose (Barber, 1971, Arch. Biochem. Biophys. 147:619-623; Barber, 1975, Arch. 
Biochem. Biophys. 167:718-722; Barber, 1979, J. Biol. Chem. 254:7600-7603; Hebda, 
et al., 1979, Arch. Biochem. Biophys. 194:496-502; Barber and Hebda, 1982, Meth 
5 Enzymol, 83:522-525). Despite these studies, GDP-D-mannose:GDP~L-galactose 
epimerase has never been well characterized nor has the gene encoding this enzyme been 
cloned and sequenced. Since the original work by Barber, GDP-D-mannose:GDP-L- 
galactose epimerase activity has been detected in the colorless microalga Prototheca 
moriformis by the assignee of the present application, and in Arabidopsis thaliana and pea 

10 embryonic axes (Wheeler, et al., 1998, ibid.). 

Barber (1979, J. Biol. Chem. 254:7600-7603) proposed a mechanism for GDP-D- 
mannose:GDP-L-galactose epimerase partially purified from the green microalga 
Chlorellapyremidosa. The overall conversion of GDP-D-mannose to GDP-L-galactose 
was proposed to proceed by oxidation of the hexosyl moiety at C-4 to a keto intermediate, 

15 ene-diol formation, and inversion of the configurations at C-3 and C-5 upon rehydration 
of the double bonds and stereospecific reduction of the keto group. The proposed 
mechanism is depicted in Fig. 7. 

Based on Barber's work, Feingold and Avigad (1980, In The Biochemistry of 
Plants, Vol. 3: Carbohydrates; Structure and Function, P.K. Stompf and E.E. Conn, eds., 

20 Academic Press, NY) elaborated further on the proposed mechanism for GDP-D- 
mannose:GDP-L-galactose epimerase. This mechanism is based on the assumption that 
the epimerase contains tightly bound NAD + , and transfer of a hydride ion from C-4 of the 
substrate (GDP-D-mannose) to enzyme-associated NAD + converts the enzyme to the 
reduced (NADH)form, generating enzyme-bound GDP-4-keto-D-mannose. The latter 

25 would then undergo epimerization by an ene-diol mechanism. The final product (GDP-L- 
galactose) would be released from the enzyme after stereospecific transfer of the hydride 
ion originally removed from C-4, simultaneously regenerating the oxidized form of the 
enzyme. 

L-fucose (6-deoxy-L-galactose) is a component of bacterial lipopolysaccharides, 
30 mammalian and plant glycoproteins and polysaccharides of plant cell walls. L-fucose is 
synthesized de novo from GDP-D-mannose by the sequential action of GDP-D-mannose- 
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4,6-dehydratase (an NAD(P)-dependent enzyme), and a bifunctional GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase (NADPH-dependent), also referred to in scientific 
literature as GDP-fucose synthetase (Rizzi, et al., 1998, Structure 6:1453-1465; Somers, 
et al., 1998, Structure 6:1601-1612). This pathway for L-fiicose biosynthesis appears to 
5 be ubiquitous (Rizzi, et al., 1998, Structure 6; 1453-1465). The mechanisms for GDP-D- 
mannose-4,6-dehydratase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase are 
shown in Fig. 8A, B (Chang, et al., 1988, J. Biol. Chem. 263:1693-1697; Barber, 1980, 
Plant Physiol 66:326-329). 

Comparison of Figs. 7 and 8 A, B reveals that Barber's proposed mechanism for 
10 GDP-D-mannose:GDP-L-galactose epimerase is analogous to the reaction mechanism for 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. The same mechanism has also 
been demonstrated for the epimerization reaction that occurs in the biosynthesis of two 
TDP-6-deoxy hexoses, TDP-L-rhamnose and TDP-6-deoxy-L-talose, from TDP-D- 
glucose (Liu and Thorson, 1994, Ann. Rev. Microbiol 48:223-256). In the latter cases, 
1 5 however, the final reduction at C-4 is catalyzed by NADPH-dependent reductases that are 
separate from the epimerase enzyme. These reductases have opposite stereospecificity, 
providing either TDP-L-rhamnose or TDP-6-deoxy-L-talose (Liu and Thorson, 1994, 

In all of the mechanisms described above, NAD(P)H is required for the final 
reduction at C-4 (refer to Fig. 8B). In the work of Hebda, et al. (1979, Arch Biochem. 
Biophys. 194:496-502), it was reported that GDP-D-mannose:GDP-L-galactose 
epimerase from C. pyrenoidosa did not require NAD, NADP or NADH for activity. 
Strangely, NADPH was not tested. Based on the analogous mechanisms shown in Figs. 
7 and 8A, B, the present inventors believe that it is likely that GDP-D-mannose:GDP-L- 
galactose epimerase from C. pyrenoidosa requires NADPH for the final reduction step. 
Why activity was detected in vitro without NADPH addition is not known, but tight 
"binding of NADPH to the enzyme could explain this observation. On the other hand, if 
the proposed mechanism of Feingold and Avigad (1980, in The Biochemistry of Plants, 
Vol. 3, p. 101-170: Caibohydrates; Structure and Function, P.K. Stompf and E.E. Conn, 
ed., Academic Press, NY) is correct, the reduced enzyme-bound cofactor generated in the 
first oxidation step of the epimerase reaction would serve as the source of electrons for 
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the final reduction of the keto group at C-4 back to the alcohol. Thus no addition of 
exogenous reduced cofactor would be required for activity in vitro. 

Recently, a human gene encoding the bifunctional GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase was cloned and sequenced (Tonetti, et al., 1996, J. BioL 
5 Chem. 271-27274-27279). This amino acid sequence of the human GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase shows significant homology (29% identity) to theE. coli 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (Tonetti, et al., 1998, Acta Cryst. 
D54:684-686; Somers, et al., 1998, Structure 6:1601-1612, both of which are 
incorporated herein by reference in their entireties). Tonetti et al. and Somers et al. 

10 additionally disclosed the tertiary (three dimensional) structure of the E. coli GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase (also known as GDP-fucose synthetase), and 
noted significant structural homology with another epimerase, UDP-galactose 4-epimerase 
(GalE). These epimerases also share significant homology at the sequence level. Since 
no gene encoding a GDP-D-mannose:GDP-L-galactose epimerase has been cloned and 

15 sequenced, homology with genes encoding GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases or with genes encoding a UDP-galactose 4-epimerase has not been 
demonstrated. However, based on the similarity of the reaction products for GDP-D- 
mannose:GDP-L-galactose epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase (i.e., GDP-L-galactose and GDP-6-deoxy-L-galactose [i.e., GDP-L-fiicose], 

20 respectively) and the common catalytic mechanisms (Figs. 7 and 8A, B) the present 
inventors believe that the genes encoding the enzymes will have a high degree of sequence 
homology, as well as tertiary structural homology. 

Significant structural homology between GDP-D-mannose: GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases may allow a GDP- 

25 4-keto-6-deoxy-D-mannose epimerase/reductase, or a homologue thereof, to function in 
the L-ascorbic acid biosynthetic pathway, and a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase could potentially be even better than a GDP-D-mannose-GDP-L- 
galactose epimerase for increasing L-ascorbic acid production in genetically engineered 
plants and/or microorganisms. Furthermore, a nucleotide sequence encoding all or part 

30 of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can be used as a probe to 
identify the gene encoding GDP-D-mannose: GDP-L-galactose epimerase. Likewise, the 
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nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase can be used to design oligonucleotide primers for use in a PCR-based strategy 
for identifying and cloning a gene encoding GDP-D-mannose: GDP-L-galactose 
epimerase. 

5 The ability to substitute GDP-4-keto-6-D-mannose epimerase/reductase for GDP- 

D-mannose:GDP-L-galactose epimerase to enhance L-ascorbic acid biosynthesis in plants 
or microorganisms depends on the ability of GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase to act directly on GDP-D-mannose to form GDP-L-galactose. Evidence 
supporting this possibility already exists. Arabidopsis thaliana murl mutants are defective 

10 in GDP-D-mannose-4,6-dehydratase activity (Bonin, et al., 1997, Proc. Natl. Acad Set 
94:2085-2090). These mutants are thus blocked in GDP-L-fucose biosynthesis, and 
consequently have less than 2% of the normal amounts of L-fucose in the primary cell 
walls of aerial portions of the plant (Zablackis, et al., 1996, Science 272:1808-1810). The 
murl mutants are more brittle than wild-type plants, are slightly dwarfed and have an 

15 apparently normal life cycle (Zablackis, et al., 272:1808-1810). When murl mutants are 
grown in the presence of exogenous L-fucose, the L-fucose content in the plant is restored 
to the wild-type state (Bonin, et al., 1997, Proc. Natl Acad Sci. 94:2085-2090). It was 
-g-gdiscQveig^ 

in the hemicellulose xyloglucan component of the primary cell wall, L-galactose in place 
20 of the normal l-fucose. L-galactose is not normally found in the xyloglucan component, 
but in murl mutants L-galactose partly replaces the terminal L-fucosyl residue. Bonin, et 
al. (1997, Proc. Natl. Acad Sci. 94:2085-2090) hypothesized that in the absence of a 
functional GDP-D-mannose-4,6-dehydratase in the wrl mutants, the GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase normally involved in L-fiicose synthesis may be 
25 able to use GDP-D-mannose directly, forming GDP-L-galactose. Another possibility, 
however, is that the enzymes involved in L-ascorbic acid biosynthesis in A. thaliana are 
responsible for forming GDP-L-galactose in the murl mutant. If this were true, it would 
suggest that in the wild-type plant, some mechanism exists that prevents GDP-L-galactose 
formed in the L-ascorbic acid pathway from entering cell wall biosynthesis and 
30 substituting for (competing with) GDP-L-fiicose for incorporation into the xyloglucan 
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component (since L-galactose is not present in the primary cell wall of the wild-type 
plant). 

Because of the similar reaction mechanisms of GDP-D-mannose:GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, and because of the 
5 evidence that GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can act directly on 
GDP-D-mannose to form GDP-L-galactose, the present inventors believe that genes 
encoding all epimerases and epimerase/reductases that act on GDP-D-mannose have high 
homology. As such, one aspect of the present invention relates to the use of any 
epimerase (and nucleic acid sequences encoding such epimerase) having significant 

10 homology (at the primary, secondary and/or tertiary structure level) to a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase or to a UDP-galactose 4-epimerase for the 
purpose of improving the biosynthetic production of L-ascorbic acid. 

Therefore, as described above, one embodiment of the present invention relates 
to a method for producing ascorbic acid or esters thereof in a microorganism, which 

15 includes culturing a microorganism having a genetic modification to increase the action 
of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Also 
included in the present invention are genetically modified microorganisms and plants in 
which the genetic modification increases the action of an epimerase that catalyzes 
conversion of GDP-D-mannose to GDP-L-galactose. 

20 According to the present invention, an increase in the action of the GDP-D- 

mannose:GDP-L-galactose epimerase in the L-ascorbic acid production pathway can be 
achieved by genetic modifications which include, but are not limited to overexpression of 
the GDP-D-mannose:GDP-L-galactose epimerase gene, a homologue of such gene, or of 
any recombinant nucleic acid sequence encoding an epimerase that is homologous in 

25 primary (nucleic acid or amino acid sequence) or tertiary (three dimensional protein) 
structure to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 
4-epimerase, such as by overexpression of a recombinant nucleic acid molecule encoding 
the epimerase gene or a homologue thereof, and/or by mutation of the endogenous or 
recombinant gene to enhance expression of the gene. 

30 According to the present invention, an epimerase that has a tertiary structure that 

is homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
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reductase is an epimerase that has a tertiary structure that substantially conforms to the 
tertiaiy structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws 
(Table 12). In another embodiment, an epimerase that has a tertiary structure that is 
5 homologous to the tertiary structure of a GDP-4-ket o-6-d eoxy-D-mannose epimerase/ 
reductase is an epimerase that has a tertiary structure that substantially conforms to the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1GFS. 
As used herein, a "tertiary structure" or tc three dimensional structure" of a protein, such 

10 terms being interchangeable, refers to the components and the manner of arrangement of 
the components in three dimensional space to constitute the protein. The use of the term 
"substantially conforms" refers to at least a portion of a tertiary structure of an epimerase 
which is sufficiently spatially similar to at least a portion of a specified three dimensional 
configuration of a particular set of atomic coordinates (e.g., those represented by 

15 Brookhaven Protein Data Bank Accession Code Ibws) to allow the tertiary structure of 
at least said portion of the epimerase to be modeled or calculated (i.e., by molecular 
replacement) using the particular set of atomic coordinates as a basis for estimating the 

More particularly, a tertiary structure that substantially conforms to a given set of 
20 atomic coordinates is a structure having an average root-mean-square deviation (RMSD) 
of less than about 2.5 A, and more preferably, less than about 2 A, and, in increasing 
preference, less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most 
preferably, less than about 0.3 A, over at least about 25% of the Ca positions as 
compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
25 reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code Ibws. In other embodiments, a structure that substantially conforms to 
a given set of atomic coordinates is a structure wherein such structure has the recited 
average root-mean-square deviation (RMSD) value over at least about 50% of the Ca 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
30 epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code Ibws, and in another embodiment, such structure has the 
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recited average root-mean-square deviation (RMSD) value over at least about 75% of the 
Ca positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code lbws, and in another embodiment, such structure has the 
5 recited average root-mean-square deviation (RMSD) value over about 100% of the Ca 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code lbws. Methods to calculate RMSD values are well known 
in the art. Various software programs for determining the tertiary structural homology 

10 between one or more proteins are known in the art and are publicly available, such as 
QUANTA (Molecular Simulations Inc.). 

A preferred epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose according to the method and genetically modified organisms of the present 
invention includes an epimerase that comprises a substrate binding site having a tertiary 

15 structure that substantially conforms to the tertiary structure of the substrate binding site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 
coordinates having Brookhaven Protein Data Bank Accession Code lbws. Preferably, the 
tertiary structure of the substrate binding site of the epimerase has an average root-mean- 
square deviation (RMSD) of less than about 2.5 A, and more preferably, less than about 

20 2 A, and, in increasing preference, less than about 1.5 A, less than about 1 A, less than 
about 0.5 A, and most preferably, less than about 0.3 A, over at least about 25% of the 
Ca positions as compared to the tertiary structure of the substrate binding site of a GDP- 
4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates 
having Brookhaven Protein Data Bank Accession Code lbws. In other embodiments, the 

25 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 50% of the Ca positions 
as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws, and in another embodiment, the 

30 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 75% of the Ca positions 
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as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws, and in another embodiment, the 
tertiary structure of the substrate binding site of the epimerase has the recited average 
5 root-mean-square deviation (RMSD) value over about 100% of the Ca positions as 
compared to the tertiaiy structure of the substrate binding site of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws. The tertiary structure of the 
substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

10 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary 
structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code 1GFS is discussed in detail in Somers et al., 1998, ibid. 

15 Another preferred epimerase according to the present invention includes an 

epimerase that comprises a catalytic site having a tertiary structure that substantially 
conforms to the tertiaiy structure of the catalytic site of a GDP-4-keto-6-deoxy-D- 

Protein Data Bank Accession Code lbws. Preferably, the tertiary structure of the 
20 catalytic site of the epimerase has an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 
less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A, over at least about 25% of the Ca positions as compared to the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 
25 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws. In other embodiments, the tertiary structure of the catalytic site of the 
epimerase has the recited average root-mean-square deviation (RMSD) value over at least 
about 50% of the Ca positions as compared to the tertiary structure of the catalytic site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 
30 coordinates having Brookhaven Protein Data Bank Accession Code lbws, and in another 
embodiment, the tertiary structure of the catalytic site of the epimerase has the recited 
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average root-mean-square deviation (RMSD) value over at least about 75% of the Ca 
positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws, and in another embodiment, the 
5 tertiary structure of the catalytic site of the epimerase has the recited average root-mean- 
square deviation (RMSD) value over 100% of the Ca positions as compared to the 
tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code lbws. 

10 In one embodiment, an epimerase encompassed by the present invention includes 

an epimerase that has a catalytic site which includes amino acid residues: serine, tyrosine 
and lysine. In a preferred embodiment, the tertiary structure positions of the amino acid 
residues serine, tyrosine and lysine substantially conform to the tertiary structure position 
of residues Serl07, Tyrl36 and Lysl40, respectively, as represented by atomic 

15 coordinates in Brookhaven Protein Data Bank Accession Code lbws. The tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 
represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws is discussed in detail in Rizzi et al. f 1998, ibid. Additionally, the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

20 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code 1GFS is discussed in detail in Somers et al., 1998, ibid. 

In an even more preferred embodiment, the above definition of "substantially 
conforms" can be further defined to include atoms of amino acid side chains. As used 
herein, the phrase "common amino acid side chains" refers to amino acid side chains that 

25 are common to both the structures which substantially conforms to a given set of atomic 
coordinates and the structure that is actually represented by such atomic coordinates. 
Preferably, a tertiary structure that substantially conforms to a given set of atomic 
coordinates is a structure having an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 

30 less than about 1 .5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A over at least about 25% of the common amino acid side chains as 
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compared to the tertiary structure represented by the given set of atomic coordinates. In 
another embodiment, a structure that substantially conforms to a given set of atomic 
coordinates is a structure having the recited average root-mean-square deviation (RMSD) 
value over at least about 50% of the common amino acid side chains as compared to the 
5 tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such structure has the recited average root-mean-square deviation (RMSD) 
value over at least about 75% of the common amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such a structure has the recited average root-mean-square deviation 

10 (RMSD) value over 100% of the common amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates. 

A tertiary structure of an epimerase which substantially conforms to a specified set 
of atomic coordinates can be modeled by a suitable modeling computer program such as 
MODELER (A, Sali and T.L. Blundell, 1 Mol Biol, vol. 234:779-815, 1993 as 

15 implemented in the Insight II Homology software package (Insight II (97.0), MSI, San 
Diego)), using information, for example, derived from the following data: (1) the amino 
acid sequence of the epimerase; (2) the amino acid sequence of the related portion(s) of 

dimensional configuration; and, (3) the atomic coordinates of the specified three 
20 dimensional configuration. Alternatively, a tertiary structure of an epimerase which 
substantially conforms to a specified set of atomic coordinates can be modeled using data 
generated from analysis of a crystallized structure of the epimerase. A tertiary structure 
of an epimerase which substantially conforms to a specified set of atomic coordinates can 
also be calculated by a method such as molecular replacement. Methods of molecular 
25 replacement are generally known by those of skill in the art (generally described in 
Brunger, Meth Enzym., vol. 276, pp. 558-580, 1997; Navaza and Saludjian, Meth 
Enzym., vol. 276, pp. 581-594, 1997; Tong and Rossmann, Meth Enzym., vol. 276, pp. 
594-611, 1997; and Bentley, Meth. Enzym., vol. 276, pp. 611-619, 1997, each of which 
are incorporated by this reference herein in their entirety) and are performed in a software 
30 program including, for example, XPLOR (Brunger, et al., Science, vol. 235, p. 458, 
1987). In addition, a structure can be modeled using techniques generally described by, 
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for example, Sali, Current Opinions in Biotechnology, vol. 6, pp. 437-451, 1995, and 
algorithms can be implemented in program packages such as Homology 95.0 (in the 
program Insight II, available from Biosym/MSI, San Diego, C A). Use of Homology 95.0 
requires an alignment of an amino acid sequence of a known structure having a known 
5 three dimensional structure with an amino acid sequence of a target structure to be 
modeled. The alignment can be a pairwise alignment or a multiple sequence alignment 
including other related sequences (for example, using the method generally described by 
Rost, Meth Enzymol, vol. 266, pp. 525-539, 1996) to improve accuracy. Structurally 
conserved regions can be identified by comparing related structural features, or by 

10 examining the degree of sequence homology between the known structure and the target 
structure. Certain coordinates for the target structure are assigned using known structures 
from the known structure. Coordinates for other regions of the target structure can be 
generated from fragments obtained from known structures such as those found in the 
Protein Data Bank maintained by Brookhaven National Laboratory, Upton, NY. 

15 Conformation of side chains of the target structure can be assigned with reference to what 
is sterically allowable and using a library of rotamers and their frequency of occurrence 
(as generally described in Ponder and Richards, J. Mol Biol., vol. 193, pp. 775-791, 
1987). The resulting model of the target structure, can be refined by molecular mechanics 
(such as embodied in the program Discover, available from Biosym/MSI) to ensure that 

20 the model is chemically and conformationally reasonable. 

According to the present invention, an epimerase that has a nucleic acid sequence 
that is homologous at the primary structure level (i.e., is a homologue of) to a nucleic acid 
sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 
galactose 4-epimerase includes any epimerase encoded by a nucleic acid sequence that is 

25 at least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to a nucleic acid sequence 
encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4- 
epimerase, and preferably to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ED NO:7 or SEQ ID NO:9. 

30 Similarly, an epimerase that has an amino acid sequence that is homologous to an amino 
acid sequence of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 



MSDOCID: <WQ 9964618A1 I > 



WO 99/64618 PCT/US99/11576 

34 

> 

galactose 4-epimerase includes any epimerase having an amino acid sequence that is at 
least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to an amino acid sequence 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4- 
5 epimerase, and preferably to an amino acid sequence selected from the group consisting 
of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO: 10. 

According to one embodiment of the present invention, homology or percent 
identity between two or more nucleic acid or amino acid sequences is performed using 
methods known in the art for aligning and/or calculating percentage identity. To compare 

1 0 the homology/percent identity between two or more sequences as set forth above, for 
example, a module contained within DNASTAR (DNASTAR, Inc., Madison, Wisconsin) 
can be used. In particular, to calculate the percent identity between two nucleic acid or 
amino acid sequences, the Lipman-Pearson method, provided by the MegAlign module 
within the DNASTAR program, is preferably used, with the following parameters, also 

1 5 referred to herein as the Lipman-Pearson standard default parameters: 

(1) Ktuple = 2; 

(2) Gap penalty = 4; 

Using the Lipman-Pearson method with these parameters, for example, the percent 
20 identity between the amino acid sequence for K coli GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (SEQ ID NO:4) and human GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (FX) (SEQ ID NO:6) is 27.7%, which is comparable to the 27% 
identity described for these enzymes in Tonetti et al., 1998, Acta Cryst D54:684-686. 
According to another embodiment of the present invention, to align two or more 
25 nucleic acid or amino acid sequences, for example to generate a consensus sequence or 
evaluate the similarity at various positions between such sequences, a CLUSTAL 
alignment program (e.g., CLUSTAL, CLUSTAL V, CLUSTAL W), also available as a 
module within the DNASTAR program, can be used using the following parameters, also 
referred to herein as the CLUSTAL standard default parameters: 
30 Multiple Alig nment Parameters fi e., for more than 2 sequences^: 

(1) Gap penalty = 10; 
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(2) Gap length penalty =10; 

Pairwise Alignment Parameters (i.e., for two sequences): 

(1) Ktuple=l; 

(2) Gap penalty = 3; 
5 (3) Window = 5; 

(4) Diagonals saved = 5. 

According to the present invention, a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase can be a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from any 
organism, including Arabidopsis thaliana y Escherichia coli, and human. A nucleic acid 

10 sequence encoding a GDP-4-keto-6-deoxy-D~mannose epimerase/reductase from 
Arabidopsis ihaliana is represented herein by SEQ ID NO:l. SEQ ID NO:l encodes a 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence 
represented herein as SEQ ID NO:2. A nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase from Escherichia coli is represented herein by 

15 SEQ ID NO:3. SEQ ID NO:3 encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase having an amino acid sequence represented herein as SEQ ID NO:4. A nucleic 
acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from 
homo sapiens is represented herein by SEQ ID NO:5. SEQ ID NO:5 encodes a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence represented 

20 herein as SEQ ID NO:6. 

According to the present invention, a UDP-galactose 4-epimerase can be a UDP- 
galactose 4-epimerase from any organism, including Escherichia coli and human. A 
nucleic acid sequence encoding a UDP-galactose 4-epimerase from Escherichia coli is 
represented herein by SEQ ID N0.7. SEQ ID NO:7 encodes a UDP-galactose 4- 

25 epimerase having an amino acid sequence represented herein as SEQ ED NO:8. A nucleic 
acid sequence encoding a UDP-galactose 4-epimerase from homo sapiens is represented 
herein by SEQ ID NO:9. SEQ ID NO:9 encodes a UDP-galactose 4-epimerase having 
an amino acid sequence represented herein as SEQ ID NO: 10. 

In a preferred embodiment, an epimerase encompassed by the present invention 

30 has an amino acid sequence that aligns with the amino acid sequence of SEQ ID NO: 1 1, 
for example using a CLUSTAL alignment program, wherein amino acid residues in the 
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amino acid sequence of the epimerase align with 100% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO: 11, and preferably at least about 75% of non-Xaa 
residues in SEQ ID NO:l 1, and more preferably, at least about 90% of non-Xaa residues 
in SEQ ID NO: 11, and even more preferably 100% of non-Xaa residues in SEQ ID 
5 NO: 11. The percent identity of residues aligning with 100% identity with non-Xaa 
residues can be simply calculated by dividing the number of 100% identical matches at 
non-Xaa residues in SEQ ID NO: 1 1 by the total number of non-Xaa residues in SEQ ID 
NO: 11. A preferred nucleic acid sequence encoding an epimerase encompassed by the 
present invention include a nucleic acid sequence encoding an epimerase having an amino 

10 acid sequence with the above described identity to SEQ ID NO: 11. Such an alignment 
using a CLUSTAL alignment program is based on the same parameters as previously 
disclosed herein. SEQ ID NO: 11 represents a consensus amino acid sequence of an 
epimerase which was derived by aligning at least portions of amino acid sequences SEQ 
ID NO:4, SEQ ID NO:6 and SEQ ID NO:8, as described in Somers et al., 1998, Structure 

15 6:1601-1612, and can be approximately duplicated using CLUSTAL. 

In another embodiment, an epimerase encompassed by the present invention 
includes an epimerase that has a catalytic site which includes amino acid residues: serine, 

positions in the epimerase amino acid sequence which align using a CLUSTAL alignment 
20 program with positions Serl05, Tyrl34 and Lysl38 of consensus sequence SEQ ID 
NO: 11, with positions Serl09, Tyrl38 and Lysl42 of sequence SEQ ID NO:2, with 
positions Serl07, Tyrl36 and Lysl40 of SEQ ID NO:4, with positions Serl 14, Tyrl43 
and Lysl47 of sequence SEQ ID NO:6, with positions Serl24, Tyrl49 and Lysl53 of 
sequence SEQ ID NO:8 or with positions Serl32, Tyrl57 and Lysl61 of sequence SEQ 
25 ID NO: 10. 

In another embodiment, an epimerase that has an amino acid sequence that is 
homologous to an amino acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase includes any epimerase that has an amino acid motif: Gly-Xaa-Xaa- 
Qy-Xaa-Xaa-CHy, which is found, for example in positions 8 through 14 of the consensus 
30 sequence SEQ ID NO:l 1, in positions 12 through 18 of SEQ ID NO:2, in positions 10 
through 16 of SEQ ID NO:4, in positions 14 through 20 of SEQ ID NO: 6, in positions 
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7 through 13 of SEQ IDNO:8, and in positions 9 through 15 of SEQ ID NO:10. Such 
a motif can be identified by its alignment with the same motif in the above-identified amino 
acid sequences using a CLUSTAL alignment program. Preferably, such motif is located 
within the first 25 N-terminal amino acids of the amino acid sequence of the epimerase. 
5 In yet another embodiment, an epimerase encompassed by the present invention 

includes an epimerase that has a substrate binding site which includes amino acid residues 
that align using a CLUSTAL alignment program with at least 50% of amino acid 
positions Asnl77, Serl78, Argl87, Arg209, Lys283, Asnl65, Serl07, Serl08, Cysl09, 
Asnl33, Tyrl36 and Ksl79 of SEQ ID NO:4. Alignment with positions Serl07, Tyrl36, 

10 Asnl65, Arg209, is preferably with 100% identity (i.e., exact match of residue, under 
parameters for alignment). 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention comprises at least 4 contiguous amino acid residues having 100% 
identity with at least 4 contiguous amino acid residues of an amino acid sequence selected 

15 from the group of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ 
ID NO: 10, as determined using a Lipman-Pearson method with Lipman-Pearson standard 
default parameters or by comparing an alignment using a CLUSTAL program with 
CLUSTAL standard default parameters. According to the present invention, the term 
"contiguous" means to be connected in an unbroken sequence. For a first sequence to 

20 have "100% identity" with a second sequence means that the first sequence exactly 
matches the second sequence with no gaps between nucleotides or amino acids. 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention is encoded by a nucleic acid sequence that comprises at least 12 
contiguous nucleic acid residues having 100% identity with at least 12 contiguous nucleic 

25 acid residues of a nucleic acid sequence selected from the group of SEQ ID NO:l, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO: 10, as determined using a 
Lipman-Pearson method with Lipman-Pearson standard default parameters or by 
comparing an alignment using a CLUSTAL program with CLUSTAL standard default 
parameters. 

30 In another embodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that hybridizes under stringent 
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hybridization conditions to a nucleic acid sequence selected from the group of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9. As used herein, 
stringent hybridization conditions refer to standard hybridization conditions under which 
nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard 
5 conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid, is 
incorporated by reference herein in its entirety (see specifically, pages 9.31-9.62). In 
addition, formulae to calculate the appropriate hybridization and wash conditions to 
achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, 

10 for example, in Meinkoth et al., 1984, Anal Biochem. 138, 267-284; Meinkoth et al., 
ibid., is incorporated by reference herein in its entirety. 

More particularly, stringent hybridization and washing conditions, as referred to 
herein, refer to conditions which permit isolation of nucleic acid molecules having at least 
about 70% nucleic acid sequence identity with the nucleic acid molecule being used to 

15 probe in the hybridization reaction, more particularly at least about 75%, and most 
particularly at least about 80%. Such conditions will vary, depending on whether 
DNA:RNA or DNA:DNA hybrids are being formed. Calculated melting temperatures for 
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embodiments, stringent hybridization conditions for DNA:DNA hybrids include 

20 hybridization at an ionic strength of 6X SSC (0.9 M Na + ) at a temperature of between 
about 20°C and about 35°C, more preferably, between about 28°C and about 40°C, and 
even more preferably, between about 35°C and about 45 °C. In particular embodiments, 
stringent hybridization conditions for DNA:RNA hybrids include hybridization at an ionic 
strength of 6X SSC (0.9 M Na + ) at a temperature of between about 30°C and about 

25 45°C, more preferably, between about 38°C and about 50°C, and even more preferably, 
between about 45 °C and about 55 °C. These values are based on calculations of a melting 
temperature for molecules larger than about 100 nucleotides, 0% formamide and a G + 
C content of about 40%. Alternatively, T m can be calculated empirically as set forth in 
Sambrook et al., supra, pages 9.31 to 9.62. 

30 In another embodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that comprises a nucleic acid 
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sequence selected from the group of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7 or SEQ ID NO:9 or a fragment thereof wherein the fragment encodes a protein 
that is capable of catalyzing the conversion of GDP-D-mannose to GDP-L-galactose, such 
as under physiological conditions. In another embodiment, an epimerase encompassed by 
5 the present invention comprises an amino acid sequence selected from the group of SEQ 
ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10 or a fragment 
thereof) wherein the fragment is capable of catalyzing the conversion of GDP-D-mannose 
to GDP-L-galactose. It is to be understood that the nucleic acid sequence encoding the 
amino acid sequences identified herein can vary due to degeneracies. As used herein, 

10 nucleotide degeneracies refers to the phenomenon that one amino acid can be encoded by 
different nucleotide codons. 

One embodiment of the present invention relates to a method to identify an 
epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Preferably, 
such a method is useful for identifying the GDP-D-mannose:GDP-L-galactose epimerase 

15 which catalyzes the conversion of GDP-D-mannose to GDP-L-galactose in the 
endogenous (i.e., naturally occurring L-ascorbic acid biosynthetic pathway of 
microorganisms and/or plants). Such a method can include the steps of: (a) contacting a 
source of nucleic acid molecules with an oligonucleotide at least about 12 nucleotides in 
length under stringent hybridization conditions, wherein the oligonucleotide is identified 

20 by its ability to hybridize under stringent hybridization conditions to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ED NO:3 and SEQ 
ID NO:5; and, (b) identifying nucleic acid molecules from the source of nucleic acid 
molecules which hybridize under stringent hybridization conditions to the oligonucleotide. 
Nucleic acid molecules identified by this method can then be isolated from the source 

25 using standard molecular biology techniques. Preferably, the source of nucleic acid 
molecules is obtained from a microorganism or plant that has an ascorbic acid production 
pathway. Such a source of nucleic acid molecules can be any source of nucleic acid 
molecules which can be isolated from an organism and/or which can be screened by 
hybridization with an oligonucleotide such as a probe or a PGR primer. Such sources 

30 include genomic and cDNA libraries and isolated RNA. 
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In order to screen cDNA libraries from different organisms and to isolate nucleic 
acid molecules encoding enzymes such as the GDP-D-mannose:GDP-L~galactose 
epimerase and related epimerases, one can use any of a variety of standard molecular and 
biochemical techniques. For example, oligonucleotide primers, preferably degenerate 
5 primers, can be designed using the most conserved regions of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase nucleic acid sequence, and such primers can be used in a 
polymerase chain reaction (PCR) protocol to amplify the same or related epimerases, 
including the GDP-D-mannose:GDP-L-galactose epimerase from the ascorbic acid 
pathway, from nucleic acids (e.g., genomic or cDNA libraries) isolated from a desired 

10 organism (e.g., a microorganism or plant having an L-ascorbic acid pathway). Similarly, 
oligonucleotide probes can be designed using the most conserved regions of a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase nucleic acid sequence and such probe can 
be used to identify and isolate nucleic acid molecules, such as from a genomic or cDNA 
library, that hybridize under conditions of low, moderate, or high stringency with the 

15 probe. 

Alternatively, the GDP-D-mannose:GDP-L-galactose epimerase can be purified 
from an organism such as Prototheca, the N-terminal amino acid sequence can be 
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can be used to design degenerate primers for amplifying a gene fragment from the 

20 organism cDNA This fragment would then be used to probe the cDNA library, and 
subsequently fragments that hybridize to the probe would be cloned in that organism or 
another suitable production organism. There is ample precedent for plant enzymes being 
expressed in an active form in bacteria, such as £. coli. Alternatively, yeast are also a 
suitable candidate for developing a heterologous system for L-ascorbic acid production. 

25 As discussed above in general for increasing the action of an enzyme in the L- 

ascorbic acid pathway according to the present invention, in one embodiment of the 
present invention, the action of an epimerase that catalyzes the conversion of GDP-D- 
mannose to GDP-L-galactose is increased by amplification of the expression (i.e., 
overexpression) of such an epimerase. Overexpression of an epimerase can be 

30 accomplished, for example, by introduction of a recombinant nucleic acid molecule 
encoding the epimerase. It is preferred that the gene encoding an epimerase according to 
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the present invention be cloned under control of an artificial promoter. The promoter can 
be any suitable promoter that will provide a level of epimerase expression required to 
maintain a sufficient level of L-ascorbic acid in the production organism. Preferred 
promoters are constitutive (rather than inducible) promoters, since the need for addition 
5 of expensive inducers is therefore obviated. The gene dosage (copy number) of a 
recombinant nucleic acid molecule according to the present invention can be varied 
according to the requirements for maximum product formation. In one embodiment, the 
recombinant nucleic acid molecule encoding an epimerase according to the present 
invention is integrated into the chromosome of the microorganism. 

10 It is another embodiment of the present invention to provide a microorganism 

having one or more epimerases according to the present invention with improved affinity 
for its substrate. An epimerase with improved affinity for its substrate can be produced 
by any suitable method of genetic modification or protein engineering. For example, 
computer-based protein engineering can be used to design an epimerase protein with 

15 greater stability and better affinity for its substrate. See for example, Maulik et al., 1997, 
Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., 
which is incorporated herein by reference in its entirety. 

As noted above, in the method for production of L-ascorbic acid of the present 
invention, a microorganism having a genetically modified L-ascorbic acid production 

20 pathway is cultured in a fermentation medium for production of L-ascorbic acid. An 
appropriate, or effective, fermentation medium refers to any medium in which a genetically 
modified microorganism of the present invention, when cultured, is capable of producing 
L-ascorbic acid. Such a medium is typically an aqueous medium comprising assimilable 
carbon, nitrogen and phosphate sources. Such a medium can also include appropriate 

25 salts, minerals, metals and other nutrients. One advantage of genetically modifying a 
microorganism as described herein is that although such genetic modifications can 
significantly alter the production of L-ascorbic acid, they can be designed such that they 
do not create any nutritional requirements for the production organism. Thus, a minimal- 
salts medium containing glucose as the sole carbon source can be used as the fermentation 

30 medium. The use of a minimal-salts-glucose medium for the L-ascorbic acid fermentation 
will also facilitate recovery and purification of the L-ascorbic acid product. 
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In one mode of operation of the present invention, the carbon source 
concentration, such as the glucose concentration, of the fermentation medium is 
monitored during fermentation. Glucose concentration of the fermentation medium can 
be monitored using known techniques, such as, for example, use of the glucose oxidase 
5 enzyme test or high pressure liquid chromatography, which can be used to monitor 
glucose concentration in the supernatant, e.g., a cell-free component of the fermentation 
medium. As stated previously, the carbon source concentration should be kept below the 
level at which cell growth inhibition occurs. Although such concentration may vary from 
organism to organism, for glucose as a carbon source, cell growth inhibition occurs at 

10 glucose concentrations greater than at about 60 g/L, and can be determined readily by 
trial. Accordingly, when glucose is used as a carbon source the glucose concentration in 
the fermentation medium is maintained in the range of from about 1 g/L to about 100 g/L, 
more preferably in the range of from about 2 g/L to about 50 g/L, and yet more preferably 
in the range of from about 5 g/L to about 20 g/L. Although the carbon source 

15 concentration can be maintained within desired levels by addition of) for example, a 
substantially pure glucose solution, it is preferred to maintain the carbon source 
concentration of the fermentation medium by addition of aliquots of the original 
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desirable because the concentrations of other nutrients in the medium (e.g. the nitrogen 
20 and phosphate sources) can be maintained simultaneously. Likewise, the trace metals 
concentrations can be maintained in the fermentation medium by addition of aliquots of 
the trace metals solution. 

In an embodiment of the fermentation process of the present invention, a 
fermentation medium is prepared as described above. This fermentation medium is 
25 inoculated with 

an actively growing culture of genetically modified microorganisms of the present 
invention in an amount sufficient to produce, after a reasonable growth period, a high cell 
density. Typical inoculation cell densities are within the range of from about 0.1 g/L to 
about 15 g/L, preferably from about 0.5 g/L to about 10 g/L and more preferably from 
30 about 1 g/L to about 5 g/L, based on the diy weight of the cells. The cells are then grown 
to a cell density in the range of from about 10 g/L to about 100 g/L preferably from about 
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20 g/L to about 80 g/L, and more preferably from about 50 g/L to about 70 g/L. The 
residence times for the microorganisms to reach the desired cell densities during 
fermentation are typically less than about 200 hours, preferably less than about 120 hours, 
and more preferably less than about 96 hours. 
5 The microorganisms useful in the method of the present invention can be cultured 

in conventional fermentation modes, which include, but are not limited to, batch, fed- 
batch, and continuous. It is preferred, however, that the fermentation be carried out in 
fed-batch mode. In such a case, during fermentation some of the components of the 
medium are depleted. It is possible to initiate fermentation with relatively high 

10 concentrations of such components so that growth is supported for a period of time before 
additions are required. The preferred ranges of these components are maintained 
throughout the fermentation by making additions as levels are depleted by fermentation. 
Levels of components in the fermentation medium can be monitored by, for example, 
sampling the fermentation medium periodically and assaying for concentrations. 

IS Alternatively, once a standard fermentation procedure is developed, additions can be made 
at timed intervals corresponding to known levels at particular times throughout the 
fermentation. As will be recognized by those in the art, the rate of consumption of 
nutrient increases during fermentation as the cell density of the medium increases. 
Moreover, to avoid introduction of foreign microorganisms into the fermentation medium, 

20 addition is performed using aseptic addition methods, as are known in the art. In addition, 
a small amount of anti-foaming agent may be added during the fermentation. 

The present inventors have determined that high levels of magnesium in the 
fermentation medium inhibits the production of L-ascorbic acid due to repression of 
enzymes early in the production pathway, although enzymes late in the pathway (i.e., from 

25 L-galactose to L-ascorbic acid) are not negatively affected (See Examples). Therefore, 
in a preferred embodiment of the method of the present invention, the step of culturing 
is carried out in a fermentation medium that is magnesium (Mg 2 *) limited. Even more 
preferably, the fermentation is magnesium limited during the cell growth phase. 
Preferably, the fermentation medium comprises less than about 0.5 g/L of Mg 2 * during the 

30 cell growth phase of fermentation, and even more preferably, less than about 0.2 g/L of 
Mg 2 *, and even more preferably, less than about 0. 1 g/L of Mg 2+ . 
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The temperature of the fermentation medium can be any temperature suitable for 
growth and ascorbic acid production, and may be modified according to the growth 
requirements of the production microorganism used. For example, prior to inoculation 
of the fermentation medium with an inoculum, the fermentation medium can be brought 
5 to and maintained at a temperature in the range of from about 20°C to about 45°C, 
preferably to a temperature in the range of from about 25°C to about 40°C, and more 
preferably in the range of from about 30°C to about 38°C. 

It is a further embodiment of the present invention to supplement and/or control 
other components and parameters of the fermentation medium, as necessary to maintain 

1 0 and/or enhance the production of L-ascoibic acid by a production organism. For example, 
in one embodiment, the pH of the fermentation medium is monitored for fluctuations in 
pH. In the fermentation method of the present invention, the pH is preferably maintained 
at a pH of from about pH 6.0 to about pH 8.0, and more preferably, at about pH 7.0. In 
the method of the present invention, if the starting pH of the fermentation medium is pH 

15 7.0, the pH of the fermentation medium is monitored for significant variations from pH 
7.0, and is adjusted accordingly, for example, by the addition of sodium hydroxide. In a 
preferred embodiment of the present invention, genetically modified microorganisms 

microorganisms include, for example, microalgae of the genera Prototheca and Chlorella 
20 (See U.S. Patent No. 5,792,63 1, ibid, and U.S. Patent No. 5,900,370, ibid). 

The production of ascorbic acid by culturing acid-tolerant microorganisms 
provides significant advantages over known ascorbic acid production methods. One such 
advantage is that such organisms are acidophilic, allowing fermentation to be carried out 
under low pH conditions, with the fermentation medium pH typically less than about 6. 
25 Below this pH, extracellular ascorbic acid produced by the microorganism during 
fermentation is relatively stable because the rate of oxidation of ascorbic acid in the 
fermentation medium by oxygen is reduced. Accordingly, high productivity levels can be 
obtained for producing L-ascorbic acid with acid-tolerant microorganisms according to 
the methods of the present invention. In addition, control of the dissolved oxygen content 
30 to very low levels to avoid oxidation of ascorbic acid is unnecessary. Moreover, this 
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advantage allows for the use of continuous recovery methods because extracellular 
medium can be treated to recover the ascorbic acid product. 

Thus, the present method can be conducted at low pH when acid-tolerant 
microorganisms are used as production organisms. The benefit of this process is that at 
5 low pH, extracellular ascorbic acid produced by the organism is degraded at a reduced 
rate than if the fermentation medium was at higher pH. For example, prior to inoculation 
of the fermentation medium with an inoculum, the pH of the fermentation medium can be 
adjusted, and further monitored during fermentation. Typically, the pH of the 
fermentation medium is brought to and maintained below about 6, preferably below 5.5, 

10 and more preferably below about 5. The pH of the fermentation medium can be 
controlled by the addition of ammonia to the fermentation medium. In such cases when 
ammonia is used to control pH, it also conveniently serves as a nitrogen source in the 
fermentation medium. 

The fermentation medium can also be maintained to have a dissolved oxygen 

15 content during the course of fermentation to maintain cell growth and to maintain cell 
metabolism for L-ascorbic acid formatioa The oxygen concentration of the fermentation 
medium can be monitored using known methods, such as through the use of an oxygen 
probe electrode. Oxygen can be added to the fermentation medium using methods known 
in the art, for example, through agitation and aeration of the medium by stirring or 

20 shaking. Preferably, the oxygen concentration in the fermentation medium is in the range 
of from about 20% to about 100% of the saturation value of oxygen in the medium based 
upon the solubility of oxygen in the fermentation medium at atmospheric pressure and at 
a temperature in the range of from about 30 °C to about 40 °C. Periodic drops in the 
oxygen concentration below this range may occur during fermentation, however, without 

25 adversely affecting the fermentation. 

The genetically modified microorganisms of the present invention are engineered 
to produce significant quantities of extracellular L-ascorbic acid. Extracellular L-ascorbic 
acid can be recovered from the fermentation medium using conventional separation and 
purification techniques. For example, the fermentation medium can be filtered or 

30 centrifuged to remove microorganisms, cell debris and other particulate matter, and L- 
ascorbic acid can be recovered from the cell-free supernate by conventional methods, such 
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as, for example, ion exchange, chromatography, extraction, solvent extraction, membrane 
separation, electrodialysis, reverse osmosis, distillation, chemical derivatization and 
crystallization. 

One such example of L-ascorbic acid recovery is provided in U.S. Patent No. 
5 4,595,659 by Cayle, incorporated herein in its entirety be reference, which discloses the 
isolation of L-ascorbic acid from an aqueous fermentation medium by ion exchange resin 

■ 

adsorption and elution, which is followed by decoloration, evaporation and crystallization. 
Further, isolation of the structurally similar isoascorbic acid from fermentation medium 
by a continuous multi-bed extraction system of anion-exchange resins is described by K. 
10 Shimizu, Agr. Biol. Chenu 31:346-353 (1967), which is incorporated herein in its entirety 
by reference. 

Intracellular L-ascorbic acid produced in accordance with the present invention 
can also be recovered and used in a variety of applications. For example, cells from the 
microorganisms can be lysed and the ascorbic acid which is released can be recovered by 
15 a variety of known techniques. Alternatively, intracellular ascorbic acid can be recovered 
by washing the cells to extract the ascorbic acid, such as through diafiltration. 

Development of a microorganism with enhanced ability to produce L-ascorbic acid 
i^gengjjigl^^ 

molecular genetic techniques, and particularly, recombinant technology (genetic 
engineering). In general, the strategy for creating a microorganism with enhanced L- 
ascorbic acid production is to (1) inactivate or delete at least one, and preferably more 
than one of the competing or inhibitory pathways in which production of L-ascorbic acid 
is negatively affected (e.g., inhibited), and more significantly to (2) amplify the L-ascorbic 
acid production pathway by increasing the action of a gene(s) encoding an enzyme(s) 
involved in the pathway. 

In one embodiment, the strategy for creating a microorganism with enhanced L- 
ascorbic acid production is to amplify the L-ascorbic acid production pathway by 
increasing the action of GDP-D-mannose:GDP-L-galactose epimerase, as discussed 
above. Such strategy includes genetically modifying the endogenous GDP-D- 
mannose:GDP-L-galactose epimerase such that L-ascorbic acid production is increased, 
and/or expressing/overexpressing a recombinant epimerase that catalyzes the conversion 
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of GDP-D-mannose to GDP-L-galactose, which includes expression of recombinant GDP- 
D-mannose:GDP-L-galactose epimerase and/or homologues thereof, and of other 
recombinant epimerases such as GDP-4-keto-6-deoxy-D-mannose epimerase reductase 
and epimerases that share structural homology with such epimerase as discussed in detail 
5 above. 

It is to be understood that a production organism can be genetically modified by 
recombinant technology in which a nucleic acid molecule encoding a protein involved in 
the L-ascorbic acid production pathway disclosed herein is transformed into a suitable 
host which is a different member of the plant kingdom from which the nucleic acid 
10 molecule was derived. For example, it is an embodiment of the present invention that a 
recombinant nucleic acid molecule encoding a GDP-D-mannose:GDP-L-galactose 
epimerase from a higher plant can be transformed into a microalgal host in order to 
overexpress the epimerase and enhance production of L-ascorbic acid in the microalgal 
production organism. 

15 As previously discussed herein, in one embodiment, a genetically modified 

microorganism can be a microorganism in which nucleic acid molecules have been deleted, 
inserted or modified, such as by insertion, deletion, substitution, and/or inversion of 
nucleotides, in such a manner that such modifications provide the desired effect within the 
microorganism. A genetically modified microorganism is preferably modified by 

20 recombinant technology, such as by introduction of an isolated nucleic acid molecule into 
a microorganism. For example, a gefietically modified microorganism can be transfected 
with a recombinant nucleic acid molecule encoding a protein of interest, such as a protein 
for which increased expression is desired. The transfected nucleic acid molecule can 
remain extrachromosomal or can integrate into one or more sites within a chromosome 

25 of the transfected (i.e., recombinant) host cell in such a manner that its ability to be 
expressed is retained. Preferably, once a host cell of the present invention is transfected 
with a nucleic acid molecule, the nucleic acid molecule is integrated into the host cell 
genome. A significant advantage of integration is that the nucleic acid molecule is stably 
maintained in the cell. In a preferred embodiment, the integrated nucleic acid molecule 

30 is operatively linked to a transcription control sequence (described below) which can be 
induced to control expression of the nucleic acid molecule. 
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A nucleic acid molecule can be integrated into the genome of the host cell either 
by random or targeted integratioa Such methods of integration are known in the art. For 
example, an£ coli strain ATCC 47002 contains mutations that confer upon it an inability 
to maintain plasmids which contain a CoIEl origin of replication. When such plasmids are 
5 transferred to this strain, selection for genetic markers contained on the plasmid results 
in integration of the plasmid into the chromosome. This strain can be transformed, for 
example, with plasmids containing the gene of interest and a selectable marker flanked 
by the 5'- and 3 '-termini of the E coli IacZ gene. The lacZ sequences target the incoming 
DNA to the lacZ gene contained in the chromosome. Integration at the lacZ locus 
10 replaces the intact lacZ gene, which encodes the enzyme P-galactosidase, with a partial 
lacZ gene interrupted by the gene of interest. Successful integrants can be selected for 
p-galactosidase negativity. 

A genetically modified microorganism can also be produced by introducing nucleic 
acid molecules into a recipient cell genome by a method such as by using a transducing 
IS bacteriophage. The use of recombinant technology and transducing bacteriophage 
technology to produce several different genetically modified microorganism of the present 
invention is known in the art. 

?3cconImg B i^tn^^rSs^^ 
mannose:GDP-L-galactose epimerase gene, includes all nucleic acid sequences related to 
20 a natural epimerase gene such as regulatory regions that control production of the 
epimerase protein encoded by that gene (such as, but not limited to, transcription, 
translation or post-translation control regions) as well as the coding region itself. In 
another embodiment, a gene, for example the GDP-D-mannose:GDP-L-galactose 
epimerase gene, can be an allelic variant that includes a similar but not identical sequence 
25 to the nucleic acid sequence encoding a given GDP-D-mannose:GDP-L-galactose 
epimerase gene. An allelic variant of a GDP-D-mannose:GDP-L-galactose epimerase 
gene which has a given nucleic acid sequence is a gene that occurs at essentially the same 
locus (or loci) in the genome as the gene having the given nucleic acid sequence, but 
which, due to natural variations caused by, for example, mutation or recombination, has 
30 a similar but not identical sequence. Allelic variants typically encode proteins having 
similar activity to that of the protein encoded by the gene to which they are being 
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compared. Allelic variants can also comprise alterations in the 5 1 or 3 1 untranslated 
regions of the gene (e.g., in regulatory control regions). Allelic variants are well known 
to those skilled in the art and would be expected to be found within a given 
microorganism or plant and/or among a group of two or more microorganisms or plants. 
5 In accordance with the present invention, an isolated nucleic acid molecule is a 

nucleic acid molecule that has been removed from its natural milieu (i.e., that has been 
subject to human manipulation). As such, "isolated 1 * does not reflect the extent to which 
the nucleic acid molecule has been purified. An isolated nucleic acid molecule can include 
DNA, RNA, or derivatives of either DNA or RNA. There is no limit, other than a 
10 practical limit, on the maximal size of a nucleic acid molecule in that the nucleic acid 
molecule can include a portion of a gene, an entire gene, or multiple genes, or portions 
thereof. 

An isolated nucleic acid molecule of the present invention can be obtained from 
its natural source either as an entire (i.e., complete) gene or a portion thereof capable of 

15 forming a stable hybrid with that gene. An isolated nucleic acid molecule can also be 
produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) 
amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include 
natural nucleic acid molecules and homologues thereof, including, but not limited to, 
natural allelic variants and modified nucleic acid molecules in which nucleotides have been 

20 inserted, deleted, substituted, and/or inverted in such a manner that such modifications 
provide the desired effect within the microorganism. A structural homologue of a nucleic 
acid sequence has been described in detail above. Preferably, a homologue of a nucleic 
acid sequence encodes a protein which has an amino acid sequence that is sufficiently 
similar to the natural protein amino acid sequence that a nucleic acid sequence encoding 

25 the homologue is capable of hybridizing under stringent conditions to (i.e., with) a nucleic 
acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid 
strand encoding the natural protein amino acid sequence). A nucleic acid molecule 
homologue encodes a protein homologue. As used herein, a homologue protein includes 
proteins in which amino acids have been deleted (e.g., a truncated version of the protein, 

30 such as a peptide), inserted, inverted, substituted and/or derivatized (e.g., by 
glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, 
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amidation and/or addition of glycosylphosphatidyl inositol) in such a manner that such 
modifications provide the desired effect on the protein and/or within the microorganism 
(e.g., increased or decreased action of the protein). 

A nucleic add molecule homologue can be produced using a number of methods 
5 known to those skilled in the art (see, for example, Sambrook et al., ibid). For example, 
nucleic acid molecules can be modified using a variety of techniques including, but not 
limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site- 
directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, 
restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, 

10 PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, 
synthesis of oligonucleotide mixtures and ligation of mixture groups to "build" a mixture 
of nucleic acid molecules and combinations thereof. Nucleic acid molecule homologies 
can be selected from a mixture of modified nucleic acids by screening for the function of 
the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene. 

15 Although the phrase "nucleic acid molecule" primarily refers to the physical nucleic 

acid molecule and the phrase "nucleic acid sequence" primarily refers to the sequence of 
nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, 

capable of encoding a gene involved in an L-ascorbic acid production pathway. 

20 Knowing the nucleic acid sequences of certain nucleic acid molecules of the 

present invention allows one skilled in the art to, for example, (a) make copies of those 
nucleic add molecules and/or (b) obtain nucldc add molecules including at least a portion 
of such nucleic acid molecules (e.g., nucleic acid molecules including full-length genes, 
full-length coding regions, regulatory control sequences, truncated coding regions). Such 

25 nucleic add molecules can be obtained in a variety of ways including traditional cloning 
techniques using oligonucleotide probes to screen appropriate libraries or DNA and PCR 
amplification of appropriate libraries or DNA using oligonucleotide primers. Preferred 
libraries to screen or from which to amplify nucleic acid molecule include bacterial and 
yeast genomic DNA libraries, and in particular, microalgal genomic DNA libraries. 

30 Techniques to clone and amplify genes are disclosed, for example, in Sambrook et al., 
ibid 
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The present invention includes a recombinant vector, which includes at least one 
isolated nucleic acid molecule of the present invention, inserted into any vector capable 
of delivering the nucleic acid molecule into a host microorganism of the present invention. 
Such a vector can contain nucleic acid sequences that are not naturally found adjacent to 
5 the isolated nucleic acid molecules to be inserted into the vector. The vector can be either 
RNA or DNA and typically is a plasmid. Recombinant vectors can be used in the cloning, 
sequencing, and/or otherwise manipulating of nucleic acid molecules. One type of 
recombinant vector, referred to herein as a recombinant molecule and described in more 
detail below, can be used in the expression of nucleic acid molecules. Preferred 

10 recombinant vectors are capable of replicating in a transformed bacterial cells, yeast cells, 
and in particular, in microalgal cells. 

Transformation of a nucleic acid molecule into a cell can be accomplished by any 
method by which a nucleic acid molecule can be inserted into the cell. Transformation 
techniques include, but are not limited to, transfection, electroporation, microinjection and 

15 biolistics. 

A recombinant cell is preferably produced by transforming a host cell with one or 
\ more recombinant molecules, each comprising one or more nucleic acid molecules 
operatively linked to an expression vector containing one or more transcription control 
sequences. The phrase, operatively linked, refers to insertion of a nucleic acid molecule 

20 into an expression vector in a manner such that the molecule is able to be expressed when 
transformed into a host cell. As usfed herein, an expression vector is a DNA or RNA 
vector that is capable of transforming a host cell and of effecting expression of a specified 
nucleic acid molecule. Preferably, the expression vector is also capable of replicating 
within the host cell. In the present invention, expression vectors are typically plasmids. 

25 Expression vectors of the present invention include any vectors that function (i.e., direct 
gene expression) in a yeast host cell, a bacterial host cell, and preferably a microalgal host 
cell. 

Nucleic acid molecules of the present invention can be operatively linked to 
expression vectors containing regulatory sequences such as transcription control 
30 sequences, translation control sequences, origins of replication, and other regulatory 
sequences that are compatible with the recombinant cell and that control the expression 
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of nucleic acid molecules of the present invention. In particular, recombinant molecules 
of the present invention include transcription control sequences. Transcription control 
sequences are sequences which control the initiation, elongation, and termination of 
transcription. Particularly important transcription control sequences are those which 
5 control transcription initiation, such as promoter, enhancer, operator and repressor 
sequences. Suitable transcription control sequences include any transcription control 
sequence that can function in yeast or bacterial cells or preferably, in microalgal cells. A 
variety of such transcription control sequences are known to those skilled in the art. 

It may be appreciated by one skilled in the art that use of recombinant DNA 

10 technologies can improve expression of transformed nucleic acid molecules by 
manipulating, for example, the number of copies of the nucleic acid molecules within a 
host cell, the efficiency with which those nucleic acid molecules are transcribed, the 
efficiency with which the resultant transcripts are translated, and the efficiency of post- 
translational modifications. Recombinant techniques useful for increasing the expression 

15 of nucleic acid molecules of the present invention include, but are not limited to, 
operatively linking nucleic acid molecules to high-copy number plasmids, integration of 
the nucleic acid molecules into the host cell chromosome, addition of vector stability 

promoters, operators, enhancers), substitutions or modifications of translational control 
20 signals, modification of nucleic acid molecules of the present invention to correspond to 
the codon usage of the host cell, deletion of sequences that destabilize transcripts, and use 
of control signals that temporally separate recombinant cell growth from recombinant 
enzyme production during fermentation. The activity of an expressed recombinant protein 
of the present invention may be improved by fragmenting, modifying, or derivatizing 
25 nucleic acid molecules encoding such a protein. 

The following experimental results are provided for the purposes of illustration and 
are not intended to limit the scope of the invention. 
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EXAMPLES 

Example 1 

The present example describes the elucidation of the pathway from glucose to L- 
ascorbic acid through GDP-D-mannose in plants. 
5 Since the present inventors have previously shown that Prototheca makes L- 

ascorbic acid (AA) from glucose, it was worthwhile to examine cultures for some of the 
early conversion products of glucose. In the past, the present inventors had concentrated 
on pathways from glucose to organic adds, based on the published pathway of L-ascorbic 
acid synthesis in animals and proposed pathways in plants. The present inventors 

10 demonstrate herein that the pathway from glucose to L-ascorbic acid involves not organic 
acids, but rather sugar phosphates and nucleotide diphosphate sugars (NDP-sugars). 

Prior to the present invention, it was known that all cells synthesize 
polysaccharides by first forming NDP-sugars. The sugar moiety is then incorporated into 
polymer, while the cleaved NDP is recycled. A variety of polysaccharides are known, and 

15 are usually named based on the relative proportions of the sugar residues in the polymers. 
For example, a "galactomannan" contains mostly galactose, and to a lesser degree, 
mannose residues. The "biopolymer" from Prototheca strains isolated by the present 
inventors was analyzed and found to be 80% D-galactose, 18% rhamnose (D- or L- 
configuration not determined), and 2% L-arabinose. The present inventors provide 

20 evidence herein of how the respective NDP-sugars that make up the Prototheca 
biopolymer are formed, and what correlations exist between L-ascorbic acid synthesis and 
the formation of the NDP-sugar forms of the sugar residues found in the biopolymer. 

The common NDP-sugar UDP-glucose is shown in Fig. 2B. This is formed in 
plants from glucose-l-P by the action of UDP-D-glucose pyrophosphorylase. UDP- 

25 glucose can be epimerized in plants to form UDP-D-galactose, using UDP-D-glucose-4- 
epimerase. UDP-D-galactose can also be formed by phosphorylation of D-galactose by 
galactokinase to form D-galactose-l-P, which can be converted to UDP-D-galactose by 
UDP-D-galactose pyrophosphorylase. These known routes were believed to account for 
the D-galactose in the Prototheca biopolymer. The UDP-L-arabinose can be formed by 

30 known reactions beginning with the oxidation of UDP-D-glucose to UDP-D-glucuronic 
acid (by UDP-D-glucose dehydrogenase), decarboxylation to UDP-D-xylose, and 
epimerization to UDP-L-arabinose. This accounts for the arabinose residues in the 
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biopolymer. UDP-L-rhamnose is known to be formed from UDP-D-glucose, thus all 
three of the sugar moieties in the Prototheca biopolymer can be accounted for by a 
pathway through glucose-l-P and UDP-glucose. Alternatively, if the rhamnose in the 
biopolymer is D-rhamnose, it is not formed via UDP-D-glucose, but by oxidation of GDP- 
5 D-mannose (See Fig. 1). 

GDP-D-rhamnose is formed by converting glucose, in turn, to D-glucose-6-P, D- 
fructose-6-P, D-mannose-6-P, D-mannose-l-P, GDP-D-mannose, and GDP-D-rhamnose. 
It was of interest to the present inventors that this route passes through GDP-D-mannose. 
Exogenous mannose is known to be converted to D-mannose-6-P in plants, and can enter 

10 the path above. D-mannose is converted to L-ascorbic acid by Prototheca cells cultured 
by the present inventors as well or better than glucose (see Example 4). The mechanism 
of conversion, in Chlorella pyrenoidosa , of GDP-D-mannose to GDP-L-galactose by 
GDP-D-mannose:GDP-L-galactose epimerase, has been known for years (See, Barber, 
1971, Arch Biochem. Biophys. 147:619-623, incorporated herein by reference in its 

1 5 entirety). The present inventors have discovered herein that L-galactose and L-galactono- 
y-lactone are rapidly converted to L-ascorbic acid by strains of Prototheca and Chlorella 
pyrenoidosa Prior to the present invention, it was known that L-galactono-y -lactone is 

step were unknown. Based on the published literature and the present experimental 
20 evidence, the present inventors have determined that the L-ascorbic acid biosynthetic 
pathway in plants passes through GDP-D-mannose and involves sugar phosphates and 
NDP-sugars. The proposed pathway is shown in Fig. 1 . Salient points relevant to the 
design and production of genetically modified microorganisms useful in the present 
method include: 

25 1. The enzymes leading from D-glucose to D-fructose-6-P are well known 

enzymes in the first, uncommitted steps of glycolysis. 

2. The enzymes involved in the conversion of D-fructose-6-P to GDP-D- 
mannose have been well characterized in plants, yeast, and bacteria, particularly 
Azotobacter vinelandii and Pseudomonas aeruginosa, which convert GDP-D-mannose 

30 to GDP-D-mannuronic acid, which is the precursor for alginate (See for example, Sa- 
Correia et al., 1987, 7. Bacteriol 169:3224-3231; Koplin et al„ 1992, 1 Bacterial 
174:191-199; Oesterhelt et al., 1996, Plant Science 121:19-27; Feingold et al., 1980, Ihfi 
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Biochemistry of Plants : Vol 3: Carbohydrates, structure and function, P.K. Stampf & E.E. 
Conn, eds., Academic Press, New York, pp. 101-170; Smith et al., 1992, Mol Cell Biol 
12:2924-2930; Boles et al., 1994, Eur, J. Med 220:83-96; Hashimoto et al., 1997, 1 
Biol. Chem. 272:16308-16314, all of which are incorporated herein by reference in their 
5 entirety). 

3. Barber (1971, supra, and 1975) identified in Chlorella pyrenoidosa the 
enzyme activities for the conversion of GDP-D-mannose to GDP-L-galactose and L- 
galactose-l-P. 

4. The present inventors have shown herein the rapid conversion of L- 
10 galactose and L-galactono-y -lactone to L-ascorbic acid by Prototheca cells. 

5. L-galactono-y -lactone and L-galactonic acid can be interconverted in 
solution by changing the pH of the solution; addition of base shifts the equilibrium to L~ 
galactonic add, while addition of acid shifts the equilibrium to the lactone. Cells may have 
an enzymatic means for this conversion in addition to this non-enzymatic route. 

15 6. In plants, GDP-L-fucose is also formed from GDP-D-mannose, presumably 

for incorporation into polysaccharide. Roberts (1971) fed labeled D-mannose to corn root 
tips and found the label in polysaccharide, specifically in the residues of D-mannose, L- 
galactose, and L-fucose. No label was detected in D-glucose, D-galactose, L-arabinose, 
or D-xylose. Prototheca and C. pyrenoidosa cells have the ability to convert L-fucose 

20 (6-deoxy-L-galactose) to a dipyridyl-positive product that was shown by HPLC not to be 

--, > 

L-ascorbic acid. The present inventors believe that it is was the 6-deoxy analog of L- 
ascorbic acid. 

Example 2 

This example shows that inPrototheca, like other plants (Loewus, F.A. 1988. In: 
25 J. Priess (ed.), The Biochemistry of Plants, 14:85-107. New York, Academic Press) and 
the green microalga Chlorella pyrenoidosa (Renstrom, et al, 1983. Plant Sci. Lett. 
28:299-305), ascorbic acid (AA) production from glucose proceeds by a biosynthetic 
pathway that allows retention of the configuration of the carbon skeleton of glucose. 

Cultures of the strain UV77-247 were grown to moderate cell density in shake 
30 flasks with l- 13 C-labeled glucose as 10% of the total glucose (40 g/L). Incubation was 
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as per the standard Mg-limited screen (see Example 3). The culture supernates were 
clarified, deionized to remove salts, lyophilized, and subjected to nuclear magnetic 
resonance (nmr) analysis to determine where in the AA molecule the 13 C was located. In 
each case, approximately 85% of the label was found at the C-l position of AA, with most 
5 of the remaining label at the C-6 position. This strongly indicated that AA is synthesized 
from glucose by a pathway that retains the carbon chain configuration, i. e., C-l of 
glucose becomes C-l of AA. This has typically been observed in plants (Loewus, F.A. 
1988. Ascorbic acid and its metabolic products. In: The Biochemistry of Plants, ed. J. 
Priess, 14:85-107. New York, Academic Press). Animals (Mapson, L.W. and F.A. 

10 Isherwood 1956. Biochem. J. 64:151-157; Loewus, F.A. 1960. J. Biol. Chem. 
235(4):937-939) and protists such as Euglena (Shigeoka, S., etal., 1979. J. Nutr. Sci. 
Vitaminol. 25:299-307), on the other hand, synthesize AA by a pathway that involves the 
inversion of configuration, /. e., C-l of glucose becomes C-6 of AA Demonstration of 
the inversion/non-inversion nature of the pathway was an important step in determining 

15 the pathway of AA biosynthesis since the two types of pathways require different types 
of enzymatic reactions. The label found at C-6 of AA is thought to be due to metabolism 
of glucose and subsequent gluconeogenesis. The metabolism of glucose in glycolysis 

■MMM pEOceedsfthifo^ 

glucose become biochemically equivalent. Hexose phosphates can be regenerated from 
20 the triose phosphates by gluconeogenesis, which is essentially a reversal of the degradative 
pathway. Consequently, metabolism of C-l -labeled glucose to triose phosphates with 
subsequent gluconeogenesis would result in the formation of hexose phosphate molecules 
labeled at either or both C-l and C-6. If those hexose phosphates were precursors to AA, 
one would expect the AA to be similarly labeled. Consistent with this type of "isotopic 
25 mixing" is the observation that sucrose obtained from l- 13 C-labeled glucose was labeled 
at positions 1,6, l 1 and 6\ 

Glucose can also be metabolized by the pentose phosphate pathway, the overall 
balanced equation for which is: 



3 Glucose-6-phosphate -» 2 Fructose-6-phosphate + Glyceraldehyde-3-phosphate + 3 C0 2 
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Based on the known biochemistry, it would then be expected that the label at each 
of the carbons in glucose (Table 1 left column) would appear at the positions for the other 
molecules shown, and that these patterns would be reflected in the AA formed from C-2- 
and C-3-labeled glucose. 

TABLE 1 

Predicted Carbon Labeling of Metabolites of Glucose in the Pentose Phosphate Pathway 



10 



20 



25 



30 



Labeled Glucose 
Carbon 


Position of Labeled Carbon in: 


COj 


F6P(1) 


F6P(2) 


G3P 


1 


+ 








2 




1,3 


1 




3 




2 


2,3 




4 




4 


4 


1 


5 




5 


5 


2 


6 




6 


6 


3 



15 AA recovered from cultures fed glucose labeled at C-2 or C-3 was also analyzed 

for its labeling patterns (Table 2). 

TABLE 2 



Carbon 
Position in AA 


Isotopic enhancement after growth on: 


C-2 labeled glucose 


C-3 labeled glucose 


1 


1.0 


0.4 


2 


10.0 


0.9 


3 


0.5 


9.9 


4 


0 


2.8 


5 


2.2 


0-2 


6 


0 


0 



The data above again suggest a pathway from glucose to AA that proceeds by 
retention of configuration. As in the experiments with C-l labeled glucose, approximately 
one-fifth of the label is present in "mirror image" position to the glucose label (C-5 for C-2 
labeled glucose and C-4 for C-3 labeled glucose), indicating levels of gluconeogenesis 
consistent with those previously observed. 

The small, but significant amount of enhancement observed in other positions is 
consistent with flux through the pentose phosphate pathway. As predicted above, carbon 
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flux through this pathway would result in isotopic enhancement at positions 1 and 3 when 
cells were grown on 2- 13 C glucose and enhancement at position 2 when cells were grown 
on 3- 13 C glucose. This is indeed observed. That there is twice as much enhancement at 
C-l as there is at C-3 after growth on 2- 13 C glucose is also predicted. These data indicate 
5 a small but measurable amount of carbon flux through the pentose phosphate pathway. 

Example 3 

This example shows the methods for generating, screening and isolating mutants 
of Prototheca with altered AA productivities compared to the starting strain ATCC 
75669. 

10 ATCC No. 75669, identified as Prototheca moriformis RSP1385 (unicellular 

green microalga), was deposited on February 8, 1994, with the American Type Culture 
Collection (ATCC), Rockville, Maryland, 20852, USA, under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure. Initial screening of Prototheca species and strains was reported in 
15 U.S. Patent No. 5,900,370, ibid. Table 3 lists the formulations of the media for growth 
and maintenance of the strains. Glucose for fermentors was supplied as glucose 
Fnfi^Hpl^ 

solution is gjven in Table 4. The standard growth temperature was 3 5 °C. All organisms 
were cultured axenically. 



20 



TABLE 3 

Media for Growth and Maintenance of Prototheca Strains 
All quantities are in g/L unless otherwise specified 



Ingredient 


Liquid 


Agar 


Standard 


Mg-limrting 


Slants 


Ferrozine 
Plates 


Standard 
Plates 


Potassium phosphate 
monobasic 


1.3 


1.3 


2.0 


0.27 


2.0 


Potassium phosphate dibasic 


3.8 


3.8 


2.0 


1.4 


2.0 


Trisodium citrate dihydrate 


7.7 


7.7 


2.6 


1.3 


2.6 


Magnesium sulfate 
heptahydrate 


0.40 


0.02 


0.4 


0.01 


0.4 


Ammonium sulfate 


3.7 


3.7 


1.0 


1.0 


1.0 


Trace Metals Solution 


2mL 


2mL 


2mL 


2mL 


2mL 


Ferrous sulfate heptahydrate 


1.5 mg 


4.5 mg 


1.5 mg 




1.5 mg 


Calcium chlorid dihydrate 




025 









25 



30 
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ingreoieni 


Liquid 


Agar 


Standard j 


Mg-limiting 


Slants 


Ferrozine 
Plat s 


Standard 
Plates 


Mangan us sulfate 
monohydrate 




0.08 








Yeast extract 






2.5 






Agar 






15 


15 
(Noble) 


15 


pH before autoclaving 


72 


7.2 


72 


72 


72 


Autoclave, then add 


Copper sulfate, 
pentahydrate, 100g/L 








2mL 




40 g/L Ferrozine In 5 mM 
! phosphate (pH 7.5 final) 








8.8 mL 




Ferric ammonium sulfate 
dodecahydrate, 40 g/L 








3.8 mL 




50% glucose with 25 mg/L 
thiamine HCI 


40 mL 


60 mL 


10 mL 


10 mL 


10 mL 



IS TABLE 4 

Trace Metals Solution 







Cone, of Individ. 


mL Indiv. Stock per 


Compound 


Molecular Weight 


Solutions, g/L 


liter of Working Stock 


Distilled Water 






823 


Hydrochloric Acid 




Cone. 


20 


Cobalt Chloride hexahydrate 


237.9 


24.0 


6.5 


Boric acid 


61.8 


38.1 


24 


Zinc sulfate heptahydrate 


287.5 


35.3 


50 


Manganous sulfate 


169.0 


24.6 


50 


monohydrate 


* ?. 






Sodium molybdate dihydrate 


242.0 


23.8 


2.0 


Calcium chloride dihydrate 


147.0 




11.4g 


Vanadyl sulfate dihydrate 


1 99.0 


10.0 


8.0 


Nickel nitrate hexahydrate 


290.8 


5.0 


8.0 


Sodium selenrte 


173.0 


5.0 


8.0 ] 



30 Mutant isolates were generated by treatment with one or more of the following 

agents: nitrous acid (NA); ethyl methane sulfonate (EMS); or ultraviolet light (UV). 
Typically, glucose-depleted cells grown in standard liquid medium were washed and 
resuspended in 25 mM phosphate buffer, pH 7.2, diluted to approximately \(P 
colony-forming units per mL (cfu/mL), exposed to the mutagen to achieve about 99% kill, 

35 incubated 4-8 hours in the dark, and spread onto standard agar medium, or agar media 
containing differential agents. 
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Some mutant colonies on standard agar medium were picked randomly and 
subcuhured to master plates. Other isolation plates were inverted over chloroform to lyse 
cells on the surface of the colonies and allow them to release AA. Released AA was 
detected by spraying the treated plates with a solution of 2,6-dichrorophenol-indophenol 
5 (1.25 g/L in 70% EtOH). The ability of AA to reduce this blue redox dye to its colorless 
form is the basis for a standard assay of AA (Omaye, et ai, 1979. Meth. Enzymol. 
62:3-11.). Colonies derived from mutagenized cells were saved to master plates for 
further evaluation if their clear halos were significantly larger than the halos typical of the 
other mutants in that group. Other mutagenized cells were spread onto plates containing 

10 an AA detection system incorporated directly into the agar. This system is based on the 
ability of AA to reduce ferric iron to ferrous iron. The compound ferrozine 
(3-(2-pyridyl)-5,6- bis(4-phenylsulfonic acid)-l,2,4-triazine) was present in the agar to 
complex with the ferrous iron and give a violet color reaction. The ferrozine agar 
formulation is shown in Table 3. Colonies giving the darkest color reactions were 

15 master-plated. When screening for non-AA-producing strains (blocked mutants), white 
colonies were chosen against a background of relatively dark colonies. 

For primary screening of tube cultures, cells were inoculated from master plates 

slanted position on a rotary shaker at 300 rpm for four days. After both three and four 
20 days of incubation aliquots were removed for AA assay and cell density determination. 
Those for AA assay were centrifiiged at 1500 x g for 5 min and the resulting supernates 
were removed for either colorimetric assay or high pressure liquid chromatography 
(HPLC). Promising isolates were retested in tube culture. Those passing the tube screen 
were tested in shake flasks. 
25 For secondary screening of flask cultures, cells were inoculated into 50 mL of 

standard flask medium in 250 mL baffled shake flasks, and incubated on a rotary shaker 
at 180 rpm until glucose depletion (24-48 hours). A second series of flasks of 
Mg-sufficient standard medium was inoculated from the first set to a cell density of 0.15 
A« 20 , and incubated for 24 hours. A third series of Mg-limiting flask medium was 
30 inoculated from the second set by a 1/50 dilution and incubated for 96 hours. Flasks were 
sampled for AA analysis and cell density measurements during this time as required. 
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Aliquots for supernatant AA analysis were centrifiiged at 5000 x g for 5 min. Aliquots 
for total whole broth AA analysis were first extracted for 15 min with an equal volume of 
5% trichloroacetic add (TCA) before centrifugation. Aliquots of the resulting supernates 
were removed for either colorimetric assay or HPLC analysis. 
5 For colorimetric assay of AA, a modification of the method of Omaye, et ai 

(1979. Meth. Enzymol. 62:3-1 1) was used. Twenty-five pL aliquots of culture supernates 
were added to wells of 96-well microplates, and 125 \xL of color reagent was added. The 
color reagent consisted of four parts 0.5% aqueous 2,2-dipyridyl and one part 8.3 mM 
ferric ammonium sulfate in 27 % (v/v) o-phosphoric acid, the two components being 

10 mixed immediately before use. After one hour, the absorbance at 520 nm was read. AA 
concentration was calculated by comparison of the absorbances of AA standards. 

HPLC analysis was based on that of Running, et aL y (1994). Supernates were 
chromatographed on a Bio-Rad HPX-87H organic acid column (Bio-Rad Laboratories, 
Richmond, CA) with 13 mM nitric acid as solvent, at a flow rate of 0.7 mL/min at room 

15 temperature. Detection was at either 254 nm using a Waters 441 detector (Millipore 
Corp., Milford, MA), or at 245 nm using a Waters 481 detector. This system can 
distinguish between the L- and D- isomers of AA 

For dry weight determinations of cell density, 5 mL whole broth samples were 
centrifuged at 5000 x g for 5 min, washed once with distilled water, and the pellet was 

20 washed into a tared aluminum weighing pan. Cells were dried for 8-24 h at 105 °C. Cell 
weight was calculated by difference. 

Table 5 shows the abilities of various mutants of Prototheca to synthesize AA, 



TABLE S 

AA Synthesizing Ability of Various Prototheca Mutants in Flask Screen 



Strain 


Specific AA Formation, mg AA per L/Culture A^o, 
during Mg-limited Incubation 


2 Days Incubation 


4 Days Incubation 


ATCC 75669 


22 


35 


EMS13-4 


79 


166 


UV213-1 


0 


0 


UV218-1 


0 


0 


UV244-1 


0 


0 
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Strain 

• 


Specific AA Formation, mg AA per L/Culture P^, 
during Mg-limiled Incubation 


2 Days Incubati n 


4 Days Incubation 


UV244-15 


53 


68 


UV77-247 


56 


83 


UV140-1 


67 


100 


UV 164-6 


91 


131 


NA21-14 


27 


78 


LA/82-21 


0 


0 


UV1 27-10 


50 


95 


SP2-3 


3 


4 



10 The genealogy of these isolates is presented graphically in the "family tree" of Fig. 

3. ATCC No. , identified as Prototheca motif ormis EMS 13-4 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 201 10, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 

15 for the Purpose of Patent Procedure. ATCC No. , identified as Prototheca 

moriformis UV127-10 (unicellular green microalga), was deposited on May 25, 1999, 
with the American Type Culture Collection (ATCC), 10801 University Boulevard, 
Manassas, VA 201 10, USA, under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. 

20 ATCC No. , identified as Prototheca moriformis SP2-3 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 201 10, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 
for the Purpose of Patent Procedure. 



25 Example 4 

The following example shows that both growing and resting cells of Prototheca 
can rapidly convert L~galactose and L-galactono-y -lactone to AA, and that conversion 
of D-mannose to AA by Prototheca is more rapid than conversion of D-glucose. 

Shake flask cultures of the mutant strain UV77-247 were grown to glucose 
30 depletion in standard liquid medium (Table 3). Cells were washed twice and resuspended 
in complete medium with the glucose substituted by one of the compounds listed below. 
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Cell suspensions were incubated for 24 hours at 35° C with shaking, and the entire 
suspension was extracted with TCA as above and assayed for AA. 

Tables 6-8 show that both growing and resting cells of strain UV77-247 can 
rapidly convert L-galactose and L-galactono-Y-lactone to AA. In these experiments, 
D-fructose and D-galactose were converted to AA at the same rate as D-glucose, 
suggesting that they are metabolized to AA through the same route as D-glucose. None 
of the organic acids suggested in the literature to be intermediates in the biosynthesis of 
AA were converted to AA, including sorbosone, which has been proposed as an 
intermediate by Saito et a/.(1990 Plant Physiol. 94:1496-1500). 



10 



15 



20 



25 



TABLE 6 

Conversion of Compounds by Resting Cells of Strain UV77-247 



30 



Substrate (50 mM) 


Total AA, mg/L 


AA Relative to No 
Substrate Control 


L-galactose 


965 


623 


L-galactono-y-lactone 


818 


476 


D-fructose 


590 


248 


D-glucosone 


589 


247 


D-glucose 


584 


242 


D-galactose 


542 


200 


D-glucose (10 mM) 


388 


46 


D-gluconolactone 


382 


40 


D-gulono-y-iactone 


366 


24 


D-glucuronate 


364 


22 


L-sorbosone 


342 


0 


None 


342 


0 


2-keto-D-gluconic acid 


341 


-1 


D-tsoascorbic acid (10 mM) 


330 


-12 


D-glucuronolactone 


329 


-13 


D-gluconic acid 


309 


-33 


D-galacturonic acid 


297 


-45 


L-idonate 


296 


-46 



35 



Since strain UV77-247 converted L-galactose and L-galactono-Y-lactone to AA 
much more rapidly than it did glucose, it suggests that these compounds are intermediates 
in the AA biosynthetic pathway and that they are "downstream" from glucose. 

The data in Tables 7 and 8 also show that growing and resting cells of UV77-247 
consistently convert D-mannose to AA at a rate greater than that of glucose. 
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TABLE 7 



Conversion of Compounds t AA by Resting Cells of Strain UV77-247 



Compound 


Ascorbic Acid, mg/L 


25.5 h 


30 h 


47 h 


L-galactose 


667 


718 


620 


L-galactono-y-lactone 


644 


681 


749 


D-glucosone 


465 


462 


354 


D-mannose 


448 


462 


399 


D-fractose 


402 


408 


367 


d-glucose 


395 


404 


351 


D-galactose 


352 


361 


337 


none 


287 


288 


258 



15 



20 



25 



30 



TABLE 8 

Conversion of Compounds to AA by Growing Cells of Strain UV77-247 





Ascorbic Acid, mg/L 




AA/A^, 


Compound 


25.5 h 


44h 


L-galactose 


249 


506 


4.5 


112 


D-mannose 


228 


488 


5.6 


87 


L-galactono-y-lactone 


214 


342 


5.0 


68 


D-glucose 


178 


396 


5.9 


67 


— Min,fi]iirtnrn^^^a 










D-glucosone 


176 


362 


5.7 


64 


D-galactose 


185 


380 


5.9 


64 


none 


182 


249 


4.4 


57 


D-gluconic acid (K) 


: 178 


262 


5.0 


52 


L-idonate (Na) 


» 182 


232 


4;7 


49 


2-keto-D-gluconic acid 


182 


255 


5.3 


48 


2-deoxy-D-glucose 


181 


227 


4.8 


47 


D-glucuronic acid lactone 


165 


218 


5.0 


44 


D-giucuronic acid (Na) 


173 


241 


5.6 


43 


L-gulono-y-lactone 


152 


195 


5.0 


39 


L-sorbosone 


178 


160 


4.7 


34 


D-glucono-6-lactone 


130 


190 


5.7 


33 


D-galacturonic acid 


130 


180 


6.0 


30 



35 



These cells converted L-galactose, L-galactono-y-lactone and D-mannose to AA 
more rapidly than they did glucose, suggesting that mannose exerts its effect in the 
biosynthetic pathway "downstream" from glucose. 
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Example 5 

Using the methods described above, a collection of mutants was assembled. The 
specific AA formation for representative mutants are shown in Table 5. The genealogy 
of these isolates is presented graphically in the "family tree" of Fig. 3. 

These isolates were tested for their ability to convert compounds which could be 
converted to AA by strain UV77-247. Testing was done as in Example 4. Results are 
shown in Table 9. 

TABLE 9 

Conversion of Compounds to AA by Resting Cells 
of Mutant Strains of Prototheca of Varying Abilities to Synthesize AA 



Strain 


Absolute AA, mg/L 


Buffer 


Glucose 


L-galactose 


L-gal-y-lacL 


Mannose 


Fructose 


EMS13-4 


53 


97 


191 


173 


139 


ND 


UV1 27-10 


45 


140 


213 


140 


128 


143 


SP2-3 


19 


19 


204 


146 


24 


27 


NA21-14 


61 


80 


147 


158 


118 


115 


UV82-21 


15 


16 


183 


175 


18 


17 


UV213-1 


16 


15 


170 


135 


17 


16 


UV218-1 


16 


18 


136 


176 


19 


21 


UV244-1 


16 


16 


164 


162 


16 


16 


UV244-15 


26 


77 


30 


21 


94 


89 


LA/244-16 


28 


64 


53 53 


53 


66 



15 



20 



ND = Not Determined 

These data suggest that the mutational blocks in those strains which convert 
fructose and mannose to AA poorly are before ("upstream" from) L-galactose and 
25 L-galactono-y-lactone in the pathway. 



30 



Example 6 

The following example shows that magnesium inhibits eariy steps in the production 

of AA 

To address the question of whether magnesium actually inhibits AA synthesis, 
strain NA45-3 (ATCC 209681) was grown in magnesium (Mg)-limited and Mg-sufficient 
medium. ATCC No. 209681, identified as Prototheca moriformis NA45-3 (Source: 
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repeated mutagenesis of ATCC No. 75669; Eucaryotic alga. Division Chlorophyta, Class 
Chlorophyceae, Order Chlorococcales), was deposited on March 13, 1998, with the 
American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 
201 10, USA, under the terms of the Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for the Purpose of Patent Procedure. Cells from both 
cultures were harvested and resuspended in the cell-free supernate from the Mg-limited 
culture, and to half of each cell suspension additional magnesium was added in order to 
bring the level in the suspension to the Mg-sufficient level. The four conditions under 
which assays were run were as follows. 



10 



TABLE 10 

Conditions Used to Test the Effect of Magnesium on AA Production 



15 



20 



25 



Condition 


Magnesium concentration, g/L, during: 


Growth 


Assay 


1Mg>1Mg 


0.02 


0.02 


1Mg>10Mg 


0.02 


02 


10Mg>1Mg 


02 


0.02 


10Mg>10Mg 


02 


0.2 



30 



'Suos^ 

D-glucosone, D-fructose. D-galactose, D-mannose, and L-galactono-y-lactone, were 
added at 20 g/L to the four cell suspensions. Accumulation of AA after 24 hours was 
measured and compared to a control in which no substrate was added. The results of this 
study are shown graphically in Fig. 4. 

When cells growing under magnesium-limited conditions were incubated with 
substrates in low-magnesium broth (lMg>lMg condition), all showed significant and 
similar accumulation of AA over the control condition. When the same cells were 
incubated in high magnesium broth (lMg>10Mg condition), the accumulation of AA was 
reduced about 40% for all substrates except D-mannose and L-galactono-y -lactone, 
suggesting that 1) the rate-limiting step in the conversion of D-glucose, D-glucosone, 
D-fructose, and D-galactose to AA is inhibited by magnesium or 2) magnesium stimulates 
an enzyme which results in the conversion of these compounds to some other 
compound(s), reducing the amount of substrate available for AA synthesis. On the other 
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hand, conversion of D-mannose and L-galactono-y-lactone appeared to be unaffected by 
the presence of magnesium in the resuspension buffer, indicating that either 1) 
magnesium-inhibited enzymes are not involved in the conversion of these substrates to AA 
or 2) D-mannose and L-galactono-y-lactone enter the pathway far enough downstream 
5 from the point where they can be siphoned off by side reactions involving Mg-requiring 
enzymes. 

When cells were grown under magnesium-sufficient conditions, very little AA 
accumulation from any of the D-sugars was observed, regardless of the level of 
magnesium in the resuspension broth. Accumulation of AA from L-galactono-y -lactone, 

10 however, was enhanced over that observed when cells are grown in Mg-limited 
conditions. This suggests that enzymes early in the pathway are repressed under 
Mg-sufficient conditions. Thus, the D-substrates all behaved similarly, with the exception 
of the apparent lack of magnesium inhibition of D-mannose conversion to AA This 
would suggest that D-mannose enters the AA biosynthetic pathway at a point other than 

1 5 the other D-sugars. 

Figs. 2A and 2B represent some of the fates of glucose in plants. The first 
enzymatic step in this scheme which commits carbon to glycolysis is the conversion of 
fructose-6-P to fructose- 1,6-diP by phosphofructokinase (PFK). This reaction is 
essentially irreversible, and leads to the well known TCA cycle and oxidative 

20 phosphorylation, with concomitant ATP and NADH/NADPH generation. PFK has an 
absolute requirement for magnesium. If magnesium is limiting, this reaction could slow 
and eventually stop, blocking the flow of carbon through glycolysis and beyond, and 
would result in cessation of cell division even in the presence of excess glucose. One 
would expect fructose-6-P to accumulate under these conditions, fueling AA synthesis by 

25 the pathway shown in Figs. 1 and 2. 

Example 7 

The following example shows the correlation in Prototheca between AA 
production and the activity levels of the enzymes in the AA pathway. 
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Phosphomannose isomerase fPlAT) Assay 

PMI activity was first assayed (See Fig. 1). Ten strains representing a range of 
AA productivities were grown according to the standard protocol to measure 
AA-synthesizing ability. Cells were harvested 96 hours into magnesium-limited 
5 incubation, washed and resuspended in buffer containing 50 mM Tris/10 mM MgCl^ pH 
7.5. The suspended cells were broken in a French press, spun at 30,000 x g for 30 
minutes, and desalted through Sephadex G-25 (Pharmacia PD- 10 columns). Reactions 
were carried out in the reverse direction by adding various volumes of extracts to 
solutions of Tris/Mg buffer containing 0.15 U phosphoglucose isomerase (EC 5.3. L9), 
10 0.5 U glucose-6-phosphate dehydrogenase (EC 1.1.1 .49), and 1 .0 mM NADP. Reactions 
were initiated by addition of 3 mM (final) mannose-6-phosphate. Final reaction volume 
was 1 .0 mL. All components were dissojved in Tris/Mg buffer. Activities were taken as 
the change in A^min. From these activities was subtracted the activities measured in 
identical reaction mixtures lacking the M-6-P substrate. Specific activities were calculated 
15 by normalizing the activities for protein concentration in the reactions. Protein in the 
original extracts was determined by the method of Bradford, using a kit from Bio-Rad 
Laboratories (Hercules, CA). All enzymes and nucleotides were purchased from Sigma 
t@h^ffi^aiie8W(fStPB5uispM©))] 



Phosphomannomutase (PMM) Assay : 

20 Phosphomannomutase was measured in a similar manner in the same strains, but 

"If 

these assay reaction mixtures also contained 0.25 mM glucose- 1,6-diphosphate, 0.5 U 
commercially available PMI, and the reactions were started with the addition of 3.0 mM 
(final) mannose-1 -phosphate rather than mannose-6-phosphate. 

Phosphofructokinase (PFIQ Assay 
25 To shed light on the possibility that the enhancement of AA concentration in 

cultures which were limited for magnesium was due to a diversion of carbon from normal 
metabolism by a reduced activity of the first committed step in glycolysis (PFK) the strains 
were also assayed to confirm the presence of this enzyme activity. Cells were cultured, 
washed and broken as above. Extracts were centrifuged at 100,000 x g for 90 min before 
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desalting. Reactions were carried out in the forward direction by adding various volumes 
of extracts to solutions of Tris/Mg buffer containing 1.5 mM dithiothreitol, 0.86 U 
aldolase (EC 4.1.2.13), 1.4 U a-glycerophosphate dehydrogenase (EC 1.1.1.8), 14 U 
triosephosphate isomerase (EC 5 .3 . 1 . 1 ), 0. 1 1 mM NADH, and 1 .0 mM ATP. Reactions 
5 were initiated by addition of 5 mM (final) fructose-6-phosphate. Final reaction volume 
was 1.0 mL. All components were dissolved in Tris/Mg buffer. Activities were taken as 
the change in A^min. From these activities were subtracted the activities measured in 
identical reaction mixtures lacking the F-6-P substrate. Specific activities were calculated 
by normalizing the activities for protein concentration in the reaction. Protein in the 
10 original extracts was determined as above. 

GPP-P-mannose pyrophosphorylasc (GMP) Assay 

These same mutant strains were assayed for the next enzyme in the proposed 
pathway, GMP. Strains were grown both according to the standard Mg-Iimiting protocol 
(harvested 43-48 hours into magnesium-limited incubation) and in standard Mg-sufficient 

15 medium (harvesting all cells before glucose depletion). Washed cell pellets were 
resuspended in 50 mM phosphate buffer, pH 7.0, containing 20% (v/v) glycerol and 0.1 
M sodium chloride (3 mL buffer/g wet cells), and broken in a French press. Crude extracts 
were spun at 15,000 x g for 15 minutes. Reactions were carried out in the forward 
direction by adding various volumes of extracts to solutions of 50 mM phosphate/4 mM 

20 MgCl 2 buffer, pH 7.0, containing 1 mM GTP. Reactions were initiated by addition of 1 
mM (final) mannose-l-phosphate. Final reaction volume was 0.1 mL. Reaction mixtures 
were incubated at 30 C for 10 min, filtered through a 0.45 \im PVDF syringe filter, and 
analyzed for GDP-mannose by HPLC. A Supelcosil SAX1 column (4.6 x 250 mm) was 
used with a solvent gradient (1 mL/min) of: A - 6 mM potassium phosphate, pH 3.6; B 

25 - 500 mM potassium phosphate, pH 4.5. The gradient was: 0-3 min, 100% A; 3-10 min, 
79% A; 10-15 min, 29% A. Column temperature was 30 C. Two assays that showed 
enzyme activity proportional to the amount of protein were averaged. Control 
no-substrate and no-extract reactions were also run. Specific activity was calculated by 
normalizing the activity for protein concentration in the reaction. Protein in the original 

30 extracts was determined as above. 
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GDP-D-mannoseODP-T galactose Enimerase Assay 

Further tests measured the activities of the next enzyme in the proposed pathway, 
GDP-D-mannose:GDP-L-galactose epimerase. Strains were grown according to the 
standard protocol, harvested 43-48 hours into magnesium-limited incubation, washed, and 
5 resuspended in buffer containing 50 mM MOPS/5 mM EDTA, pH 7.2. Washed pellets 
were broken in a French press, and spun at 20,000 x g for 20 min. Protein determinations 
were made as above and a dilution series of each was made, ranging from 0.4 to 2.2 mg 
protein/mL. 50 \xh aliquots of these dilutions were added to 10 \iL aliquots of 6.3 mM 
GDP-D-mannose in which a portion of this substrate was universally labeled with 14 C in 

10 the mannose moiety. This substrate had an activity of 16 jiCi/niL before dilution into the 
reaction mixture. Reactions were stopped after 10 min by transferring 20 nL of the 
mixture into microfuge tubes containing 20 \iL of 250 mM trifluoroacetic acid (TFA) 
containing 1.0 g/L each D-mannose and L-galactose. These tubes were sealed and boiled 
for 10 min, cooled, spun for 60 sec in a Beckman Microfuge E, and 5 jiL of each 

15 hydrolysate was spotted on 20 x 20 cm plastic-backed EM Science Silica gel 60 thin-layer 
chromatography plates (#5748/7), with 1 cm lanes created by scoring with a blunt stylus. 
After drying, plates were twice chromatographed for 2.5 hours in ethyl 

sugars were visualized by spraying dried plates with 0.5% p-anisaldehyde in a 62% 
20 ethanolic solution of 0.89 M sulfuric acid and 0. 1 7 mM glacial acetic acid, and heating at 
105 C for about 15 min. Spots of L-galactose and D-mannose were cut from the plates 
and counted in a scintillation counter (Beckman model 2800). For time-zero control 
counts, 16.7 nL of each extract dilution was added to 23.3 \xL of the labeled substrate 
above, which had been diluted 1:7 with the TFA/mannose/galactose solution. 
25 Table 1 1 summarizes the results of the five enzyme assays for the strains tested, 

along with their specific AA formations. 
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TABLE 11 

Specific Enzyme Activities (mU)* of Selected Mutant Prototheca Strains 



Strain 


AA Specific 
Form, mg/g 


mW±m\ mm 

PMI 


PMM 


PFK 


GMP 


Epimerase 


Mg- 
limited 


Mg~ 

sufficient 


UV164-6 


78.4 












0.79 


EMS13-4 


73.7 


10.8 


69.6 


13.5 


2.6 


6.8 


0.78 


UV140-1 


69.0 












0.78 


NA45-3 


61.4 












0.58 


UV77-247 


44.4 












0.52 


UV1 27-10 


40.1 


11.1 


45.8 


24.4 
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Unite: PMI and PMM, nmoles NADP reduced per min/mg protein; PFK, n moles NADH oxidized per 
min/mg protein; GMP, nmoles GDP-D-mannose formed per min/mg protein; epimerase, nmoles 
GDP-L-galactose formed per min/mg protein. 

The only enzyme which showed a strong correlation between activity and the 
ability to synthesize AA was the GDP-D-mannose:GDP-L-galactose epimerase. This 
correlation is depicted in Fig. 5. All of the strains which produced measurable amounts 
of AA had measurable amounts of epimerase activity. The converse was not true: four 
of the strains which synthesize little or no AA had significant epimerase activities. These 
strains are candidates for having mutations which affect enzymatic steps downstream from 
the epimerase. Since all of the strains tested can synthesize AA from L-galactose and 
L-galactono-y-lactone (see Examples 4 and 5), the genetic lesion(s) in these four mutants 
must lie between GDP-L-galactose and free L-galactose. 

Example 8 

The next example shows the relationship between GDP-D-mannose: GDP-L- 
galactose epimerase activity and the degree of magnesium limitation in two strains, the 
original unmutagenized parent strain ATCC 75669, and one of the best AA producers, 
EMS13-4 (ATCC ). 
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Four flasks of each strain were grown according to the standard protocol. One 
culture of each was harvested 24 hours into magnesium-limited incubation, and every 24 
hours thereafter for a total of four days. O.ie flask of each strain was also harvested 24 
hours into magnesium sufficient incubation. All cultures had glucose remaining when 
5 harvested. Fig. 6 shows graphically the AA productivity and epimerase activity in 
EMS 13-4 and ATCC 75669 as the cultures became Mg-limited. Epimerase activity in 
EMS 13-4 was significantly greater than that in ATCC 75669 at all time points. There was 
also a concurrent rapid rise in both AA productivity and epimerase activity in EMS 13-4 
as the cultures became increasingly Mg-limited. While there was a moderate increase in 
10 AA productivity in ATCC 75669 as Mg became more limiting, there was no effect on 
epimerase activity. 

Examck_2 

The following example shows the results of epimerase assays performed with 
extracts of two K coli strains into which were cloned the E. coli gene for GDP-4-keto-6- 
15 deoxy-D-mannose epimerase/reductase. 

The E. coli K12 wca gene cluster is responsible for cholanic acid production; 
wcaG encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. 

■■^■■■BtfiiS f Hfi i sfiftB nriWf rTjirnrrTTOnrrrlTTriiTin 

amplified by PCR from E. coli W31 10 genomic DNA using primers WG EcoRI 5 (5* 
20 TAGAATTCAGTAAACAACGAGTTTTTATTGCTGG 3'; SEQ ID NO: 12) and WG 
Xhol 3 (5* AACTCGAGTTACCCCCAAAGCGGTCTTGATTC 3 1 ; SEQ ID NO: 13). 
The 973-bp PCR product was ligated into the vector pPCR-Script SK(+) (Stratagene, 
LaJolla, CA). The 973-bp ExoRII/XhoI fragment was moved from this plasmid into the 
ExoRII/XhoI sites of pGEX-5X-l (Amersham Pharmacia Biotech, Piscataway, NJ), 
25 creating plasmid pSW67-l . Plasmid pGEX-5X-l is a GST gene fusion vector which adds 
a 26-kDa GST moiety onto the N-terminal end of the protein of interest. K coli 
BL21(DE3) was transformed with pSW67-l and pGEX-5X-l, resulting in strains 
BL21(DE3)/pSW67-l and BL21(DE3)/pGEX-5X-l. 

The E coli wcaG sequence (nucleotides 1 through 966 of SEQ ID NO:3) was also 
30 amplified by PCR from £. coli W3 1 10 genomic DNA using primers WG EcoRI 5-2 (5* 
CTGGAGTCGAATTCATGAGTAAACAACGAG 3'; SEQ ID NO: 14) and WG PstI 3 
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(5* AACTGCAGTTACCCCCGAAAGCGGTCTTGATTC 3'; SEQ ID NO:15). The 976- 
bp PCR product was ligated into a pPCR-Script (Stratagene). The 976-bp ExoRH/Pstl 
fragment was moved from this plasmid into the ExoRH/Pstl sites of expression vector 
pKK223-3 (Amersham Pharmacia Biotech), creating plasmid pSW75-2. E. coli JM105 
5 was transformed with pKK223-3 and pSW75-2, resulting in strains JM1 05/pKK223-3 and 
JM105/pSW75-2. 

All six strains were grown in duplicate at 37°C with shaking in 2X YTA medium 
until an optical density of 0.8-1.0 at 600 nm was reached (about three hours). 2X YTA 
contains 16 g/L tryptone, 10 g/L yeast extract, 5 g/L sodium chloride and 100 mg/L 

10 ampicillin. One of each culture was induced by adding isopropyl P-D- 
thiogalactopyranoside (IPTG) to 1 mM final concentration. All 12 cultures were 
incubated for an additional four hours, washed in 0.9% NaCl, and the cells were frozen 
at -80°C. Prior to pelleting the cells for preparation of extracts, a portion of each culture 
was used for a plasmid DNA miniprep to confirm the presence of the appropriate plasmids 

15 in these strains. A protein preparation of each culture was also run on SDS gels to 
confirm expression of a protein of the appropriate size where expected. Frozen pellets 
were thawed, resuspended in 2.5 mL MOPS/EDTA buffer, pH 7.2, broken in a French 
Press (10,000 psi), spun for 20 min at 20,000 x g, assayed for protein as above and diluted 
to 0.01, 0.1, 1.0 and 3 mg/mL protein. 

20 Induction of the strain BL2 1 (DE3)/pGEX-5X- 1 resulted in high-level expression 

of a 26-kDa protein indicating the synthesis of the native GST protein. Induction of strain 
BL21(DE3)/pSW67-l resulted in high-level expression of a 624cDa protein, indicating the 
synthesis of the native GST protein (26K) fused to the wcaG gene product (36K). An 
aliquot of the fusion protein was treated with the protease Factor Xa (New England 

25 Biolabs, Beverly, MA), which cleaves near the GSVwcaG junction. Induction of the 
strain JM105/pSW75-2 resulted in high level expression of a 36-kDa protein, indicating 
the synthesis of the wcaG gene product. No such protein was detected in 
JM105/pKK223-3 (vector only). 

Next, it was of interest to test extracts in the standard epimerase assay described 

30 in Example 7 to determine if any of the extracts containing the wcaG product could bring 
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about the conversion of GDP-D-mannose to GDP-L-galactose. The extracts to be 
assayed are: 

BL21Q2E2) Group 

1 . BL2 1 (DE3) uninduced 

5 2. BL2 1 (DE3) induced with ImM IPTG 

3. BL21(DE3)/pGEX-5X-l uninduced 

4. BL21(DE3)/pGEX-5X-l induced with ImM IPTG 

5. BL21(DE3)/pSW67-l uninduced 

6. BL21(pE3ypSW67-l induced with 1 mM IPTG; fusion protein intact 

10 7. BL21(DE3)/pSW67-l induced with 1 mM IPTG; GST moiety cleaved 

JM1Q5 Group 

1 . JM105 uninduced 

2. JM105 induced with ImM IPTG 

3. JM105/pKK223-3 uninduced 

15 4. JM105/pKK223-3 induced with 1 mM IPTG 

5. JM105/pSW75-2 uninduced 

6. JM105/pSW75-2 induced with 1 mM IPTG 

Extracts 1 and 7 from the BL21(DE3) group and extracts 1 and 6 from the JM105 
group were tested for GDP-D-mannose.GDP-L-galactose epimerase-like activity in a pilot 
20 experiment. In this initial experiment, no epimerase activity was detected in any of the 

it is possible that the wcaG gene product is incapable of catalyzing the conversion of 
GDP-D-mannose to GDP-L-galactose, although this conclusion can not be reached until 
several other parameters are tested. Second, it is possible that under the assay conditions 

25 which are satisfactory to measure activity for the endogenous GDP-D-mannose:GDP-L- 
galactose epimerase, the wcaG gene product does not have GDP-D-mannose:GDP-L- 
galactose epimerase-like activity. Therefore, alternate conditions should be tested. 
Additionally, confirmation experiments should be performed to confirm the accuracy of 
the pilot conditions. Third, although the BL21(DE3) and the JM105 clones produce 

30 proteins of the expected size, the constructs have not been sequenced to confirm the 
proper coding sequence for the wcaG gene product and thereby rule out PGR or cloning 
errors which may render the wcaG gene product inactive. Fourth, the protein formed 
from the cloned sequence is full-length, but inactive, for example, due to incorrect tertiary 
structure (folding). Fifth, the gene is overexpressed, resulting in accumulation of insoluble 

35 and inactive protein products (inclusion bodies). Future experiments will attempt to 
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determine whether the constructs have or can be induced to have the ability to catalyze 
the conversion of GDP-D-mannose to GDP-L-galactose, and to use the sequences to 
isolate the endogenous GDP-D-mannose:GDP-L-galactose epimerase. 

Table 12 provides the atomic coordinates for Brookhaven Protein Data Bank 
Accession Code lbws: 

TABLE 12 
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SEORES 9 A 321 LEU GLY SER SER CYS ILE TYR PRO LYS LEU 



SEORES 10 A 321 PRO MET ALA GLU SER GLU LEU LEU GLN GLY THR LEU GLU 

SEPRES 11 A 321 PRP THR ASN gLV |>RP TYR ALA ILE ALA LYE ILF &L& gLy 

SEORES 12 A 321 ILE L YS LEU CYS jp? SER TYR ASN ARG GLN TYR GLY ARG 

SEORES 13 A 321 ASP TYR ARG SER VAL MET PRO THR ASN LEU TYR GLY PRO 

SEORES 14 A 321 HIS ASP ASN PHE HIS PRO SER ASN SER HIS VAL ILE PRO 

SEORES 15 A 321 ALA LEU LEU AR G ARG PHE HIS GLU ALA THR ALA GLN ASN 

SEORES If A 321 ALA PRQ A^P VAL VAL VAL tr.p gLY SER EftY THR .PRP MET 

35 SEORES 17 A 321 ARG GLU PHE LEU HIS VAL ASP ASP MET ALA ALA ALA SER 

SEORES 18 A 321 ILE HIS VAL MET GLU LEU ALA HIS GLU VAL TRP LEU GLU 

SEORES 19 A 321 A5N THR PLN ppp MET LEV SER HIS ILE ASN VAL GLY THR 

SEORES 20 A 321 GLY VAL ASP CYS THR ILE ARG ASP VAL ALA GLN THR ILE 

SEORES 21 A 321 ALA LY S VAL VAL G LY TYR LYS GLY ARG VAL VAL PHE ASP 

40 SEORES 22 A 321 ALA SER LYS PRO ASP G LY THR PRO ARG LYS LEU LEU ASP 

SEORES 23 A 321 VAL THR ARG LEU HIS GLN LEU GLY TRP TYR HIS GLU ILE 

sfiQRfiS 24 A 321 sgR LEV ,<?LU axa ELY l*pv ATA spR THR TYR PLN TRP pfflS 

SEORES 25 A 321 LEU GLU ASN GLN ASP ARG PHE ARG GLY 

HET NDP 10 



45 HETNAM NDP NICOTINAMIDE ADENINE DINUCLEOTIDE PHOSPHATE 
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SHEET 


4 A 6 LYS A 101 LEU A 105 1 


N LYS A 101 


0 VAL A 59 








SHEET 


5 A 6 ASP A 157 PRO A 163 1 


N ASP A 157 


0 LEU A 102 








SHEET 


6 A 9 ILE A 243 YAL A 245 1 


N ILE A 243 


0 MET A 192 . 








SHEET 


1 B 2 ASN A 165 TYR A 167 0 














SHEET 


2 B 2 PHE A 211 HIS A 213 1 


N LEU A 212 


0 ASK A 165 






25 


SHEET. 


1 C 2 ASP A 198 TRP A 202 0 














SHEET 


2 C 2 ARG A 269 ASP A 273 1 


■J? _ARG^A 269 


Q VAL A 199 








SITE . . 


1 CAT 1 TYR 


136 














SITE . 


2 CAT 1 LYS 


149 














SITE 


3 CAT 1 SER 


107 












30 




104.200 104.200 75.880 90.00 


9Q.0Q 


120.00 P 


• 32 n 9 








9FI9X1 


1.000000 


0.000000 0.000000 




0.00000 










QRICX2 


0.PPQ90Q 


1.000000 0.000000 




9,99999 










9RIGX3 


9.999999 


0.000000 1.000000 


0.00Q99 












0.009597 


9,995541 9,999999. 




0.00000 








35 


££ALE2_ 


0.000000 


9.911991 9.9990OQ 




9.OQQQ0 










SCALE? 


0.000000 


Q.000000 0,013179 




0.00000 










HETATM 


1 Q HQH 


1 55.C52 - 


16.806 


22,535 


1,99 9.73 




.0 




HBTATM 


2 o HOfl 


3 59r494 - 


10 T 63? 


19,749 


1,90 13,17 




9 




HETATM 


3 O HOH 


4 58.230 - 


11.715 


27.770 


1.00 19.07 




0 


40 


HETATM 


4 0 nop 


5 57.25? 


-3,759 


30.107 


ItPO 11.21 




9 




HETATM 


5 0 HOH 


6 58.298 - 


10.011 


25,527 


1x99 15,74 




.9 




HETATM 


<? 0 HQH 


7 49.321 


6.583 


38.815 


1.00 19.33 




.9 




HETATM 


7 „P HQH 


9 53,795 


H.2« 


22.464 


1.00 10.94 




Q 




HETATM 


8 0 HOH 


19 74^52.. 


2r999 


9.141 


1.00 17.80 




J? 


45 


HETATM 


9 p HOH 


11 4?. 763- 


_ 9,926 


32.999 


1.00 22.02 




g 



SDOCID: <WO 996461 BA1 I > 



WO 99/64618 



PCT/US99/I1576 



82 



10 Q HSH. 



12 



55.530 -11.162 28.526 



HE T ATM 11 Q 



11 



75.027 7.034 27.353 



HETATM 12 0 HSH. 



1± 



49,99* -2,314 11,032 



HETATM 13 P HQH. 



15 



61,323 29.657 



HETATM 14 Q HQH 16 



61,029 -Ut?^ 29.131 



HETATM 1? Q HSIL 



XL 



1SL3M 5,991 10,130 



HETATM 16 O HOH 



11 



64.506 -6. 302 32.989 



HETATM 17 Q , HQH 



11 



57.956 -16.399 25.Q95 



HETATM .19 Q HOH 



21 



39.97? ?6,$36 19.Q70 



10 



HETATM 19 O HQH. 



21. 



38.042 33. 487 21.909 



HETATM 20 Q HQH. 



21 



39,172 35,775 20.927 



HETATM 21 Q HQH. 



21 



HL21& -11.129 15.241 



H3TATM 22 9 TOP 



26 



54.205 19,36P 29,396 



HETATM 23 Q HQH. 



27 



5P.436 2,554 16,793 



15 



HETATM . 24 Q HQfl 



28 



69.692 19.108 38.979 



HETATM 25 Q HQH 



21 



56,432 -8,977 19,303 



HETATM 25 0 HQH 



gQ. 932 3.415 42.349 



HETATM 27 



-HQEL 



21. 



53,88? ~12 T 7Q6 29.764 



HEIATH 29 Q HQH 



22. 



37.997 26.373 29.059 



20 



HETATM 29 O HOH 



21 



49.201 11. 173 26.867 



HETATM 30 O HOH 



21 



46.762 -0.278 31.394 



HETATM 31 O HOH 



21 



41.731 27.568 43.302 



HETATM 32 Q HQH 



21 



65.927 11.202 29,929 
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'1 "1? -T 0.6W43 W 8i68. 
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72.27^ -9.099 33.407 



HETATM 37 Q HQH 
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42.685 34. 461 33.955 
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$3,490 13,394 39,364 
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11 
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11 
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11 
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11 
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40 
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11 
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12. 
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HETATM 52 O HOH 



61 
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HETATM 53 O HOH 



31 
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HETATM 55 Q HQH §2 42.746 25.153 23.465 1.00 ?7.0S Q 

HBiam 5 6 0 HQH fifi 53.638 -16,457 32.292 1.00 31.71 Q 

HE T ATM 57 0 HQH £2 33.390 41.716 31.408 1.00 ?q.Q2 Q 

HETATM 58 Q HQH 22 57.768 17.897 42.434 1.00 25.75 Q 

5 HE T ATM 59 0 HQH 21 lS^Ml 9.164 11.766 1.00 3S.13 Q 

HETATM 60 O HQH 72 62.032 33.292 44.749 1.00 4fi.T8 Q 

HETATM 61 0 HQH 7? 47.310 14.312 34.285 l.QQ 31.18 Q 

HfilATU 62 Q HQH 24 79.660 -3.947 15.933 1.00 34.63 Q 

HETATM 63 0 HQH 75 4 6.929 5.343 4.5 50 l.QQ 23.14 £ 

10 HETATM 64 Q HQH £6 73,175 12,039 28.412 1.00 27.26 Q 

HETATM 6? O HQH 22 46.297 -6.982 30.032 1.00 43.41 Q 

HETATM 66 O HQH Ifi 68.528 -3.422 40.86 9 l.QQ 38.47 Q 

H5TATM 67 Q HQH 22 62.080 -1.448 42 .B0 3 1.00 24.60 Q 

HETATM 68 0 HQH 80 65.330 18.150 40.726 l.QQ 41.00 Q 

15 HETATM 69 O HQH 81 51.775 16.128 37.607 ^ .00 25.13 Q 

HEIA2M 70 o HQH .93 54.266 28,682 43.313 1.00 27.61 Q 

HETATM 71 O HQH 83 73.291 -15.479 20.603 1.00 37.54 O 

HETATM 72 Q HQH M 34.760 21.479 28.544 1.00 43.87 Q 

HETATM 7? Q HQH U 37.326 34,131 2?. 677 l,QQ 24. V? Q 

20 HETATM 74 Q HQH 88 65.168 20.148 6.735 1.00 26.10 Q 

HETATM 75 O HQH 89 59.196 12.089 13.630 1.00 25.24 Q 

HETATM 76 0 HQH 21 66,576 -6,2?? 4P.27? l.QQ 43,11 Q 

HETATM 77 0 HQH 23 37.339 29.394 25.515 1.00 27.56 Q 

HETATM 78 O HQH 94 52.339 -17.014 42.271 1.00 48.96 Q 

25 HETATM 79 O HQH 95 4P.5U 32.?27 31,717 1.00 22,46 £ 

HETATM 80 O HQH 96 78.580 13.121 34.138 1.00 27.98 Q 

HETATM 81 O HQH 97 65.090 15.704 34.876 1.00 18.96 Q 

HETATM 82 O HQH 99 84.562 2.951 27.181 1.00 35.92 Q 

HETATM 83 O HQH 100 50.386 9.761 9.646 1.00 23.18 Q 

30 HETATM 84 O HQH 101 67.649 -0.851 38.764 l.QQ 24.?? Q 

HETATM 85 O HQH 102 44.001 4.?93 34,315 1 ■ 0Q 31,13 Q 

hetatm 86 o HQH LPJ 59.396, -5-071 26,211 l.QQ 2?.1Q Q 

HETATM 87 O HQH 104 77.364 4.745 41.506 1.00 35.32 Q 

HETATM 88 O HQH 105 59.034 21.201 32.414 1.00 23.43 Q 

35 HETATM 89 O HQH 106 42.463 34.698 14.327 1.00 38.86 Q 

HETATM 90 O HQH 107 70.217 14.292 20.864 1.00 42.39 Q 

HETATM 91 0 HQH lUfi 76..??? 9.130 25.962 l.QQ 32,93 Q 

HETATM 92 O HQH 109 49.766 29.937 22.173 1.00 42.52 Q 

HETATM 93 O HQH 110 72.473 13.536 38.823 1 , 0Q 33.32 Q 

40 HETATM 94 O HQH 111 6 4.328 -12.084 38.608 1.00 37.99 Q 

HETATM 95 O HQH 112 60.161 1 6.382 42.682 1.00 35.68 Q 

HETATM 96 O HQH 113 47.602 1 3.639 27.016 1.00 26.01 Q 

HETATM 97 O HQH 115 64.606 1 1.644 40.107 1,QQ ?Q f ?3 Q 

HETATM 98 O HQH 116 61.231 -15.137 27.255. l.QQ 38.76 Q 

45 HETATM 99 O HQH 117 65.324 -1 1 r 223 35.098 l.QQ 3Pt4.5 Q 
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qfTATM 100 Q HOH 1^ 



56.602 17.219 41.932 



HETATM 101 Q HQH 12 



RF.TATM 102 O HOH 1? 



HETATM 103 P HOH 123 



HETATM 104 Q HOH 



HETATM 19? P HOH 1? 



HEM .106 P HOH 126 



HETATM 1Q7 P HOH 12 



HETATM 109 P HQH 12 



10 



HETATM 109 O HOH 129 



rATM 110 AOS* NDP A 



HETATM 111 ACS* HDP A 



HETATM 112 AC4* NDP A 



HETATM 113 A04 + WPP A 



15 



HETATM 114 AC?* WPP A 



HETATM US A°3* WPP A 



HETATM 116 AC2* NDP A 



HETATM 117 AQ2* NDP A 



HETATM 118 AC1* NDP A 



20 



HETATM 119 Q3 NDP A 



HETATM 120 W° 5 * WPP A 



HETATM 121 NC5* NDP A 



HETATM 122 NC4* NDP A 



HETATM 123 NP4 + WPP A 



HETATM 126 MC2* WPP A 



HETATM 127 NQ2* NDP A 



HETATM 128 NCI* NDP A 



30 



HETATM.. 129 A P2* N P P A 



HETATM 130 AOP1 NDP A 



HETATM 131 APP2 WPP A 



HETATM 132 AOP3 WPP A 



HETATM 133 AP WPP A 



35 



HETATM 134. API WPP A 



HETATM 1 35 AO 2 NDP A 



HETATM 13 6 AN 9 NDP A 



HETATM 137 Ape W PP A 



HETATM 13 9 AN7 NDP A 



40 



HETATM 139 AC 5 NDP A 



HETATM 140 AC 6 NDP A 



HETATM 141 AN6 WPP A 



37.564 19.860 23.135 



64.845 5.057 21.132 



63,391 IS. 901 26,999 



42.567 6.134 32.635 



72.485 13.236 35.059 



65.229 3,650 44,032 



37.089 7.148 31.083 



73.327 10.546 12.123 



74.450 10.299 26.598 



$7,524 13.,P55 26,592 



69.099 12,297 25,614 



69.601 12. 124 25.858 



70.193 11,259 24.949 



70.494 13.390 25.973 



71-192 13,436 27,066 



71.373 13,220 24,626 



72.623 13.886 24.655 



71.510 11.702 24.656 



l.QQ 19.02 



65.336 13.590 26.129 



53 ,536 1 1 ,943 26 .449 



64.329 10.943 25,957 



63.467 9.646 25.686 



62.837 9.337 26.908 




61.15? 9,996 25.139 



60.881 7.662 24.715 



61.547 8.875 26.580 



73.104 15.069 23.823 



74 l ;sfoo 



^ 5,3 09.. 24,309 



72,797 14.925 22.348 



72.163 16.217 23.958 



66.660 14.257 26.393 



6$.99(i n t 7p$ 25.047 



66.439 15.207 27.521 



71.820 11.224 23.353 



71,1 04 » ,316 22.2Q Q 



71,758 10,835 21,161 



72,933 10,313 21,710 



74.053 9.657 21.140 



74. 16$ 9,464 19,919 



HETATM 142 AN1 NDP A 1_ 



75.078 9.280 21.942 



HETATM 143 AC 2 NDP A 



74.971 9.578 23.251 



45 HETATM 144 AN3 NDP A 1 



74.027 10. 302 23.889 
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.00 20,59 
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.oo 32.96 



.00 37, E4 
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.00 31.97 



.00 26.17 



l.QQ 15.31 



.00 34.39 



■ QO 13,63 



l.QQ 12.41 



,00 15,71 



■ oo 16,1-7 



.QQ ., 3 1. 35 



.00 12.59 



.00 17.56 



.00 15.44 



■QQ 24.92 



XL 



XL 
XL 



XL 



XL 
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HflTATM 145 AC4 NDP A 1 73.036 10.653 23.047 1.00 17,48 XX 

K^TATM 146 NP WPP A ■ 1 64.183 13,106 27 r l91 1.00 25.47 ft 

HCTATM 147 NOl NDP A 1 63.142 14,169 27.253 1.00 28.69 

^TATM 148 NQ2 NDP A 1 64.837 12.643 28.492 1.00 24.32 

5 HE T ATM 149 NNl NDP A 1 60.598 9.775 27.109 1.00 23.63 

RET ATM 150 NC2 NDP A 1 60.143 10.905 26.442-99.00 78.36 H 

HETATM 151 NC3 NDP A 1 59.070 11,648 27.007-99.00100.00 H 

HE T ATM 152 NC7 NDP A 1 58.497 13.017 26.528-99.00100.00 H 

HETATM 15? MQ2 NPP A 1 59.358 13.7 03 25.972-99.00100.00 H 

10 HETATM 154 NN7 NDP A 1 57.207 13.400 26.912-99.00 84.38 

HETATM 155 NC4 NDP A 1 58.442 11.146 28.137-99.00100.00 

HETATM 15 g ^cs Npp a 1 58-912 28t7£4-99-QP10P. QQ H 

HETATM 157 NC6 NDP A 1 59.951 9.266 28.14 7-99.00100.00 

AXOH 159 N US_ft 3 76,227 -5.632 44.315 1.00 61.49 H 

15 ATOM 15? gA LYS A 3 Z&OSJ -4»3Q2 43, ggj l r pg sg.pp £ 

him 160 C LYS A 3 75.985 -4.421 42.171 1.00 52.79 C 

AISM 161 P 1YS A 2 76.921 -4,737 41.119 l.QP 44. .7$ 2 

ATfiM U2 CP LYS A 3 77.359 -3.417 44.030 1.00 59.74 £ 

AXQM 163 CG LYS A 3 77.011 -1.944 44.3 14 1.00 50.87 £ 

20 AXQM. 154 CP LYS A 3 79,298 =UL£L 44.994 l.PP 61.21 £ 

AXCM 165 CE LYS A 3 77.855 - 0.377 46.1 86 1,00100.00 £ 

A12M 166 NZ LYS A 3 78.857 -0.401 47.343 1.00 70.63 22 

AICM 167 H £LH A 4 74.746 -4.242 41,747 1,PP 45, l ,li> H 

ATOM 1€8 C A GLN A. 4 74.4 0ft -4.326 40.347 1.00 37.18 C 

25 ATOM 169 C GLN A 4 74.983 -3.166 39.561 1.00 34.93 £ 

ATOM 170 O GLN A 4 75.127 -2.050 40.087 1.00 28.48 O 

ATOM 171 C P < ?M ? A 4 7 2,915 -4.445 4P,221 l.PP 34.$$ £ 

ATOM 172 CG GLN A 4 72.456 -5.854 40.584 1.00 31.82 £ 

ATOM 17 3 CD GLN A 4 72.570 -6.788 39.405 1.00 79.25 £ 

30 ATOM 174 .QE1 CLN A i 72.165 -6.452 39,296 l.P01PP,QP Q 

ATOM 175 NE2 GLN A 4 73.20 6 -7.925 39.623 1.00 80.24 H 

ATOM 176 N ARG A 5 75.475 -3.495 38.375 1.00 27.16 ft 

&XOI 177 CA ARG A 5 76.146 -2.546 37.483 1.00 39.16 £ 

ATOM 17 8 C ARG A 5 75.191 -2.01B 36.433 1.00 38.22 £ 

35 ATOM. 179 O ARG A 5 74.938 -2.698 35.438 1.00 32.44 Q 

AT CM 180 CB ARG A 5 7 7.398 -3.163 36.826 1.00 41.76 £ 

ATOM 181 CG ARG A 5 78.692 -2.954 37.663 1.00 37.34 £ 

ATOM 182 CD ARG A 5 80.015 -3.236 36.876 1.00 32.99 £ 

ATOM 183 NE ARG A 5 8 1.036 -2.203 37.125 -1.00 25.71 H 

40 ATOM 184 C2 ARG A 5 81.617 -1.488 36.169 1.00 32.53 £ 

ATOM 185 NH1 ARG A 5 81.293 -1.704 34,904 1.00 40,07 H 

ATOM 186 NH2 ARG A 5 8 2.516 -0.551 36.474 1.00100.00 U 

ATOM 187 N VAL A 6 74.743 -0.773 36.659 1.00 32.08 R 

ATOM 188 CA VAL A 6 73.715 -Q.082 35.881 1,00 28.89 £ 

45 Aim 189 C VAL A S 74.161 1.021 34.897 1,00 29.37 £ 
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ATOM. 



74,745 2.041 3$,274 



ATOM 191 CB VAL A 



72.577 0.378 36.613 



AXQtt 192 CGI VAL A 



AJJ31 



193 052 vaj, a 



ATOM 

ATOM 



194 N BHE A 



19? CA PHE A 



ATOM 



196 c PHE A 



ATOM 



197 O EH£Jl 



ATOM. 



199 CB PHE A 



10 



ATOM 

ATOM 



199 Qg PHE A, 



2QQ QPl PHE A 



ATOM 



201 CD2 PHE A 



ATOM 



202 CE1 PHE A 



ATOM 



2P? CE2 PHE A 



15 



ATW 204 CZ PHE A 



71.36fr 0.960 36.006 



72.108 -0.852 37.644 



73.948 0.749 33.615 



74,267 1.710 32.573 



72,975 2.423 32.192 



71,994 1,799 31.815 



74.864 1.004 31.374 



74.916 1.836 30.115 



75.521 3.087 30.108 



74.483 1.284 28.BB6 



75.614 3.828 28.902 



74.548 1.996 27.685 



75,129 3,255 27.673 



ATOM 



205 N ILE A 8 



72.959 3.727 32.454 



ATOM 



ZM . OA ILE A 8 



71.844 4.599 32.112 



A3 



207 Q ILE A 9 



72.337 5.351 30.909 



ATOM 



-2J1S. 



ILE A 8 



73.259 6.165 30.998 



20 



A3 



209 CB TLB A 8 



71.507 5.693 33.212 



ATOM 



210 CGI IL E A 8 



71.356 4.949 34.582 



ATOM 



21 1 c<?Z ILE A 9 



70 ,1 93 6- 3 .42 3 2.9.74 



ATOM 



212 CD1 ILE A 8 



71.Q91 5 t <>$l 35,707 



ATOM. 



213 -N ALA-A. 



71.896 4.906 29.752 



25 



CA ALA A 



72.256 5.559 28.513 




ATOM 



216 P ALA A. 



70. .411 7,032 29.045 



ATOM 



217 CB ALA A 



71.808 4.731 27.311 



ATOM. 



219 N CL Y A 19 



72.199 7.922 27.940 



30 



ATOM 219 CA GLY A 10 



71.706 9,284 27 t ?H 



ATOM 



220 -C GLY A 10 



71.407 9.819 29.305 



Arm. 



221 Q GLY A 10 



70.379 10.448 29.481 



ATOM. 



■222 H HI? A 11 



72.295 9.591 30.272 



ATOM 



223 CA HIS A 11 



72.06B 9.966 31.688 



35 



ATOM. 



224 C HIS A M 



7g. ope U.5Q4 3i.pi6 



AXSai 225 Q HIS A U. 



71,700 U.994 32,98? 



ATOM. 



226 CP HIS A .U 



73.153 9.350 32.581 



ATOM 



227 00 HIS A U 



74.5Q2 9.949 32.326 



ATOM 



228 ND1 HIS A 11 



75.239 9.649 31.197 



40 



ATOM. 



22? CP2 HIS A 11 



75.167 10.952 32.956 



ATOM 



230 CE1 HIS A 11 



76.317 10.407 31.170 



ATOM. 



231 NE2 HIS A 11 



76.271 11.240 32.197 



Aiai 232 N ARE A 12 



72,319 12r288 30,909 



ATOM. 



233 CA ARG A 12 



72.147 13.693 31.122 



-00 22.50 



>00 23.52 



.00 20.29 



-00 19.4? 



.00 22.92 



.00 27,15 



■OP 20,24 



.00 20.71 



.00 19.99 



■00 21.83 



.00 19.36 



.00 23.50 



.00 27.52 



x 90 19,33 . 



.00 19.59 



.00 19,75 



•QQ 14.25 



.00 11.22 



00 17,76 



.00 14.15 



■QQ 8-24 



.00 16.95. 



.00 10.32 



18-74 



00 22.39 



.QQ 14-43. 



.00 20.06 



.00 19.62 



-00 16.40 



.00 17.36 



-09 10.32 



-00 13.90 



-00 21.52 



.00 13,22 



-QQ 14. 



.00 23.7? 



00 24.90 



.09 16-35 



-00 22.54 



■00 17.56 



-09 22,31 



-00 18.90 



45 



ATOM. 



234 C ARG A 12 



70.851 14. 244 30.495 1.00 26.34 



Si 

Si 

-H 



c 



Ji 
-£ 
Ji 

S. 



SI 
Si 

Si 

jQ. 

Ji 
Si 
Si 
Si 
Si 
£ 
Ji 

Ji 

Ji 

Si 
c 
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AISM 235. 9 ARGA 12 70.572 15.426 30 .604 1.00 25.37 2 

AXCM 23g CP ARC? ft 12 73.352 14.418 30.587 l.QQ 2S . 93 £ 

AXSM 237 CG ARGA 12 74.582 13,943 31.2 79 1.00 53.87 £ 

AXCM 238 CD ARGA 12 75.757 14.619 30.699 1.00 32.53 £ 

5 23? ME ABS ft 12 76.359 15.576 31.605 1.00 69.90 H 

AISM 240 CZ ARG A 12 76.971 16.675 31.178 1. 00100.00 £ 

Kim 241 NH1 ARG A 12 77.001 16.948 29.867 1.00100.00 H 

AlflM 242 NH2 ARG A 12 77.526 17.508 32. 056 1.00100.00 H 

AXCM 243 W ELY A 13 70,07? 13,420 29,900 1,00 19,25 U 

10 AI2M 24J CA GLY A 13 68.802 13.904 29.258 1.00 16.50 £ 

M25 245 £ GLY A 13 67.849 14.144 30.428 1.00 18.88 £ 

AXQM 2.4$ 0 GLY A 13 68.202 13.902 31.624 1.00 14.04 Q 

ATOM 247 H MET A 14 66.653 14.632 30.103 1.00 1,^.00 ft 

AX2M 248 CA MET A 14 65.688 14.981 31.128 1.00 13.49 C 

15 AXflM 249 C MET ft 14 65.293 13.760 31.901 1.00.14.02 £ 

AlflM £1Q O MET A 14 65.408 13.713 33.145 1.00 17.06 Q 

AICM 251 CB MET A 14 64.442 15.605 30.^24 1.00 11.57 _£ 

AXCM 252 Cg MET A 14 , 63.320 15.628 31.559 1.00 20.77 £ 

AXSM_ Z21 SD MET A 14 61.926 16.766 31.110 1.00 ?9.1$ g. 

20 ATOM IM CE MET A 14 62.527 17.108 29.574 1.00 30.68 £ 

AXflM 255. H VAL A 15 64.798 12.769 31.158 1.00 25.23 £ 

AlflM 256 Cft VftL A 64.439 11.468 31.738 1.00 20.90 £ 

Aiai 257 C VAL A 15 65.654 10.713 32.378 1.00 17.26 £ 

AXQM Z53 O VAL A 15 65.590 10.239 33.524 1.00 18.41 £ 

25 ATOM 25? CB VAL A 15 63.752 10.550 30.680 1.00 23.25 £ 

ATOM 260 CGI VAL A 15 63.330 9.253 31.310 1.00 ^5.71 £ 

ATOM 261 CG2 VAL A 15 62.528 11.193 30.183 1.00 13.40 C 

A2EM 262 N GLY A 16 66.784 10.642 31.665 1.00 20.39 ft 

ATPM 213 CA GLY A 16 67.941 9.904 32.186 1.00 19.54 £ 

30 A1£M 264 g GLY A 16 68.522 10.432 33.492 1.00 29.29 £ 

Al£Sa 265 O GLY A 16 68.896 9.659 34. 434 1.00 16.91 £ 

Alfitt Zte U SERA 17 68.642 11.755 33.499 1.00 12.53 H 

AIQM 267 CA SER A 37 39,154 12,460 34,650 1.00 21.93, £ 

AISH 218 C SERA 17 68.209 12.214 35.818 1.00 13.35 £ 

35 AI2M 269 g SERA 17 68.677 11.957 36.915 1.00 24.19 £ 

AIQM 210 CB SERA 17 69.378 13.942 34.333 1.00 15.52 £ 

AX2M 271 OG SERA 17 68.153 14.619 34.372 1.00 ?? . 95 £ 

ATOM 272 N ALA A 18 66.896 12.143 35.590 1.00 17.52 H 

AXQ8 273 CA ALA A lfi 65.99 1 11 .629 36,729 1- 90 13,14 £ 

40 ATPM 274 C ALA A IB 66.220 10.393 37.307 1.00 19.29 £ 

AlfiM ?75 O ALA A 18 66.149 10.150 38.522 1.00 16.94 O 

ATOM 276 CB ALA A 18 64,460 12 T Q46 36. 334 1.00 lj. 33 £ 

AXOl 22'' . M ILEA 19 66.484 9.432 36.4 30 1.00 20.80 ft 

AlSaJ 278 CA ILEA 19 66.705 8.078 36.900 1.00 Ifl. Oft £ 

45 AXSM 279 .. C ILEA 19 67.975 8.090 37.7 30 1.00 jg. 09 £ 



SDOCID:<WO 9964618A1 I > 



WO 99/64618 



PCT/Usy9/ii57b 



88 



ATOM 280 O ILEA 19 68.016 7.530 38.820 1.00 20.73 Q 

ATOM 281 CB ILE A 19 66.B04 7.079 35.710 1.00 17.56 £ 

ATOM 282 CGI ILE A 19 65.444 6.812 35.162 1.00 10.09 £ 

ATOM 283 CG2 ILE A 19 67.309 $.555 36,1?? 1,QQ 21.50 £ 

5 ATOM 284 CD1 ILE A 19 65.528 6.361 33.741 1.00 19.05 £ 

ATOM 285 K ARG A 20 68.964 B.771 37.19B 1.00 18.13 M 

ATOM 286 CA ARG A 20 70.286 8.897 37.836 1.00 20.25 £ 

ATOM 287 C ARG A 20 70.231 9.491 39.242 1.00 30.62 £ 

aiSM 299 P ARE A 20 70,957 9.091 40.129 1.00 33.00 q 

10 ATOM 289 CB ARG A 20 71.201 9.743 36.957 1.00 11.71 ; £ 

A TOM 290 CG ARG A 20 72.610 9.781 37.449 1.00 23.79 C 

ATOM 291 CD ARG A 20 72.881 11.107 38.060 1.00 36.76 £ 

ATOM 292 NE ARG A 20 74.297 11.443 38. 062 1.00 48.34 H 

ATOM. 293 CZ ARG A 20 74.990 11.8 41 36.986 1.00100.00 £ 

15 ATOM 294 NH1 ARG A 20 74.393 11. 931 35.808 1.00100.00 U 

AXOH 295 NH2 ARG A 20 76.289 12.139 37.076 1.00100.00 H 

AXSM 295 N ARC A 21 69.368 10.461 39.439 1.00 22.10 H 

AlflH 297 CA ARG A 21 69,216 11.052 40. 750 1.00 17.45 £ 

ATOM 298 C ARG A 21 68.721 10.007 41. 730 1.00 26.71 £ 

20 ATOM , 299 O ARG A 21 69.147 10.001 42.885 1.00 30.27 Q 

ATOM 300 CB ARG A 21 68.142 12.144 40.708 1.00 17.93 _£ 

ATOM 301 CG ARG A 21 68.682 13. 522 40.321 1.00 27.57 £ 

ATOM 302 CD ARG A 21 67.586 1 4.599 40.130 1.00 23.02 £ 

AI2* 303 WE ARG A 21 67.619 15.000 38.743 1.00 55.12 H 

25ATOM 304 C2 ARg A 21 $6.538 15.103 37.995 1.00 10.55 £ 

WWM i B5l* WII 3 l o9^^ ■ Wll|l — — 1 1 1 1 

ATOM 30 6 NH2 ARG A 21 66.665 15.435 36.715 1.00 61.45 H 

AlfiM 307 N gift A, 22 67,713 9.223 41.345 1.00 27.48 H 

ATOM m QK, GfoN A 22, 57,157 8-257 42.313 1.09 24.79 £ 

30 AT9 M 309 C SLN A 22 59 .137 7,127 42,547 1,00 31.37 £ 

ATOM 310 P ELN A 22 6PV394 5,724 43.68$ l.QQ 27.47 ; Q 

A12M 311 CP SEN A 22 55.919 7.706 41.894 1.00 17.11 £ 

ATOM 312 CG A 22 64,921 8 f 7j$ 41.243 1,00 55.14 £ 

ATOM 313 CD GLK A 22 63.425 8.456 41.397 1.00 41.27 _£ 

35 ATOM 314 OE1 GLN A 22 53-002 7.329 41.762 1.00 29.34 Q 

ATOM 315 NE2 GLK A 22 52.510 9.464 41.046 1.00 20.12 _Jl 

ATOM 316 N LEU A 23 68.597 6.552 41.448 1,00 27.99 H 

ATOM 317 CA LEU A 23 69,549 St?7$ U.500 1.00 24,49 . £ 

ATOM 318 C LEU A 23 70.828 5.971 42.334 1.00 28.87 £ 

40 ATOM 319 O LEU A 23 11,298 5.219 43.165 1.00 30.79 Q 

ATOM 320 CB LEU A 23 70.036 $.107. 40.089 1.00 22.72 £ 

ATOM 321 CG LEU A 23 68.966 4.072 39.658 1.00 26.16 £ 

ATOM 322 CD1 LEU A 23 69.271 3.083 38.481 1.00 24.80 C 

ATOM 323 CD2 LEU A 23 68 t 42,7 3.294 40.93? 1 T 00 22.91 £ 

45 atom 324 n glu a 24 71.279 .7.192 42.153 1.00.29.77 u 
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&TPK 325 ca glv a 24 12^412 T- 675 42.gn9 i.qq 33. 79 £ 

AI0H 326 C gLV A 24 72.363 7.388 44.41? l.QQ 35.94 £ 

AX2M 327 0 ?LV A 24 73.381 7,140 45.031 1.00 39.07 _Q 

AI2M 328 CB GLU A 24 72.647 9.165 42.6 53 1.00 36.21 £ 

5 ATOM 329 CG GLU A 24 74.068 9.482 42.24 3 1.00 42.54 £ 

AXflM 330 CD GLU A 24 74.15B 10.689 41.333 1.00 89.51 £ 

AX2M 331 OE1 GLU A 24 73.386 11.663 41. $49 1.00 43. 21 Q 

AX0H 122 OE2 GLU A 24 74.994 10.646 40.398 1.00 66.28 Q 

hlSM 333 N gLN ft 2? 71.182 7.422 45.000 1.00 45.70 H 

10 Aim 334 CA GLWA 25 71.039 7.152 46.432 1.00 47.57 £ 

&Xai 33? C GLN A 25 70.887 5.669 46.740 1.00 67.34 £ 

AXCM 336 P GLN A..25 70.285 5.286 47.726 1.00 74.06 Q 

AICM 337 CB G LN A 25 69. 78$ 7.842 46.905 1.00 51.85 £ 

AXSM 339 QS GLU A 25 69.5QQ 9.084 46.109 1.00 44.91 £ 

15 Um 339 CD GLN A 25 68.419 9.913 46 .742 1.00100.00 £ 

ATOM 340 OE1 GLN A 25 68.271 9.947 47.972 1.00100.00 Q 

ATPM 341 NE2 GLN A 25 67.624 10.602 45.911 1.00100.00 ft 

ATOM 342 N ARG A 26 71.322 4.831 45.825 1.00 75.37 H 

ATOM- 343 CA ARG A 26 71.182 3.407 46.026 1.00 74. B7 £ 

20 Axon 344 C ARE A 26 72,568 2,791 46,147 l.PP 74rP8 _£ 

ATOM 34 5 O ARG A 26 73.440 2.997 45.289 1.00 77.00 Q 

AIPU 346 CB ARG A 2g 70.390 2.790 44.885 1.00 52.44 £ 

AX0U 347 CG A RG A 26 68.916 2.927 45.070 1.00 43.51 C 

BXPJ5 34? CP AR5 A 26 S3, 429. .1,752 45.864 l.PP 4P,7P C 

25 ATPM 349 NE ARG A 26 67.200 1.176 45.338 1.00 42.33 {J 

ATOM 35 0 CZ ARG A 26 67.126 0.508 44.196 1.00 32.07 £ 

AXQH 351 NH1 ARG A 26 63 .2 15 Q ,324 43.496 1,00 44,02 K 

ATOM 35 2 NH2 ARG A 26 65.968 0.017 43.771 1.00 77.32 H 

AIP25 353 y gliY A 27 72.779 2,114 47.266 IrPP 46. ?Q li 

30 AIOH 354 CA GLY A 27 74.060 1.531 47.549 1.00 46.82 £ 

ATOM 355 C GLY A 27 74.140 0 .165 46.923 1.00 55.45 £ 

ATOM 356 O GLY A 27 75.204 -0. 453 46.877 1.00 64.43 Q 

ATOM 357 N ASP A 28 73.017 -0.315 46.428 1.00 40.98 N 

ATOM 35 8 CA ASP A 28 73.016 -1.647 45.861 1.00 40.35 £ 

35 A12M 359 C AS P A 28 73.266 -1.536 44.400 1.00 39.55 £ 

ATOM 36 0 0 ASP A 28 73.109 -2.518 43.654 1.00 48 .80 O 

ATOM 36 1 CB ASP A 28 71.680 -2.335 46.127 1.00 47.80 £ 

ATOM 362 CG ASP A 28 70.503 -1.373 46.064 1.00 35.34 £ 

ATOM 36 3 QD1 ASP A 28 70.705 -0.1 40 46.095 1.00 39.23 Q 

40 Alps 364 PP2 ASP A 28 69,383 ~l t g7P 15,672 1*00 69,86 Q 

ATOM 36 5 N VAL A 29 73.651 -0. 329 43.996 1.00 31.03 

ATOM 366 CA VAL A 29 73.881 -0.050 42.591 1.00 28.44 £ 

AI2f 367 C VAL A 29 75.166 0.676 42.281 1.00 28.00 £ 

AISW 368 O VAL A 29 75.505 1.699 42.892 1.00 34.83 Q 

45 ATOM 369 CB VAL A 29 72.696 0.760 42.000 1.00 30.68 £ 
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ATOM. 370 CGI VM A 79 72.935 1.088 40.549 1.00 23.65 £ 

AlflM 371 CG2 VAL A 29 71.416 -0.028 42.156 1.00 27.95 £ 

AISM 372 N GLU A 30 75.824 0.219 41.230 1.00 30.76 H 

AIOJ 373 CA GL U A 30 76.995 0.924 40.736 1.00 28.38 £ 

5 AIM 374 C GLU A 30 76.678 1.471 39.332 1.00 31.03 £ 

AI£M 375 O GLU A 30 76.368 0.720 38.397 1.00 26.64 O 

AXflM 376 CB GLU A 30 78.199 0.006 40.72 ? 1.00 31.84 £ 

AI2M 377 CG GLU A 30 79.355 Q f 539 11.533 l.QQ 99.2$ £ 

MO! 22S CD GLU A 30 80.667 0.264 40.858 1.00100.00 £ 

. 10 Alps 37? PE1 SLV A 30 81,082 -Q.922 40.972 1.00 89. 94 2 

MO! 380 OE2 GLU A 30 81.202 1.206 40.219 1.00100.00 £ 

hXSii 391 W LEU A 31 76.665 2.789 39.20 7 1.00 22.24 H 

AICM 392 CA X/EV ft 31 76,2$? 3,393, 37.945 l.QQ 29,?7 £ 

AI21 393 C LEU A 31 77.404 3.507 36.941 1.00 25.79 £ 

15 ATOM 384 O LEU A 31 78.485 3.969 37.256 1.00 29.41 Q 

AlOl 38$ cb LEV A 31 75. €32 4.7$Q 39.191 1.00 30.20 £ 

AlflM 386 CG LEU A 31 74.329 4.763 38.994 1.00 29.37 £ 

AXPtf 397 CP1 LEV A 31 73.941 $.143 39.24.Q 1.00 23.43 £ 

AXOf 399 CP2 LEV A 31 12*225 3.9$2 39.291 1,90 23.Q4 £ 

20 atjqm 389 N VAL A 32 77.14$ 3.100 35.711 1.00 21.94 U 

AISM 390 CA VAL A 32 78.143 3.265 34.685 1.00 25.48 £ 

ATOM 391 , JS VAL A 32 77.535 4.242 33.669 1.00 38.76 £ 

ATOM 392 P VAL A 32 76.429 3.999 3 3.180 1.00 29.70 Q 

AXGM. 393 £B VAL A 32 78.517 1.902 34.055 1.00 34.25 £ 

■° g — 1 — TT — ^MW^^WWB ifiTlMiiTmiTpMinn-nMi apMMMg 

Alffll 395 CG2 VAL A 32 79.003 0.950 35.139 1.00 25.27 £ 

A1CM 39$ K LEV A 33 79.219 5.375 33.457 1.00 30.19 K 

AXCM 397 CA LEU A 33 77.732 6.463 32.621 1.00 22.71 £ 

AXQM. 398 p LEV A 33 78.727 6.979 31.645 1.00 29.55 £ 

30 AXfiH 399 O LEV A 33 79.996 7.152 31.988 1.00 30.09 Q 

Aiaa 400 CB LEU A 33 77.423 7.635 33.514 1.00 19.75 £ 

AISM 401 CG LEU A 33 76.729 7.200 34.779 1.00 19.38 £ 

AX2M 402 CB1 LEV A 33 7$. 914 8 t 344 35.7$2 1.00 27.24 £ 

ATOM 403 CD2 LEU A 33 7 $ ,211 6.913 34.444 1.00 22.07 £ 

35 ATOM 404 V ARC A 34 79,23? 7.421 30.496 1.00 15.09 h 

ATJ2M 405 CA ARG A 34 79.154 9.009 29.541 1.00 2$. 04 £ 

ATOM 406 C ARG A 34 78.46 9 9.173 28.916 1.00 36.57 £ 

AISM 407 P ARg a 34 77,299 9.130 29. $51 1-00 38.59 £ 

AIOH 408 CB ARG A 34 79.486 7. 048 28.398 1.00 22.89 £ 

40 ATOM 409 CG ARG A 34 90,579 6.081 28.70 6 1.00 23.29 £ 

ATOM 410 CD ARG A 34 91t370 $r$7$ 29.8$0 1,QQ 53,0$ £ 

ATOM 411 NE ARG A 34 91.793 5.4 59 30 . 7 11 l,op 90.25 N 

atom 412 cz arg A 34 82. $4$ 4.53P 3Q.323 l.PP 41.94 £ 

ATOM 413 WH1 ARG A 34 83.173 4.596 29.104 1.00 53.02 H 

45 ATOM 414 NH2 ARE A 34 92.993 3-547 31.148 l f QQ 25.5$ H 
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ATCM 415 V Tffi\ h 35 19.248 10,15$ 29,539 l t QQ 31,59 H 

atom 416 ca thr A 35 79.703 11,292 27, 933 1,Q,P ,29,33 £ 

atom 417 c thra 35 79,719 10,951 29,340 1,00 32.$? £ 

ATOM 418 O THR A 35 79.350 9.944 25.962 1.00 28.08 a 

5 ATOM 419 CB THR A 35 7 9.527 12.527 28.145 1.00 37.49 £ 

ATOM 420 OG1 THR A 35 80.844 12.429 27.560 1.00 31.91 £ 

ATOM 421 CG2 THR A 35 79.627 12.642 29.651 1.00 19.38 £ 

ATOM 422 M ARG A 36 78.032 11.780 25.529 1.00 30.02 M 

ATOM 423 CA ARE A 3$ 79.002 24.056 1.90 29,37 £ 

10 ATOM 424 C ARG A 36 79.406 11.765 23.503 1.00 31.46 £ 

ATOM 425 . Q AFC A 39 79.772 11.012 22.59 1 . 1,00 , £ 

ATOM 426 CP ARC A 3$ 77,054 12,^50 23,354 1.00 37,34 £ 

ATOM 427 CG ARG A 36 76.937 12.465 21.846-99.00 49.47 £ 

ATOM 428 CD ARG A 36 76.020 13.515 ' 21.232-99.00 63.09 £ 

15 ATOM 429 NE ARG A 36 75.528 13.124 19.915-99.00 75.23 H 

AISM 430.CZ Afig A 36 7 4,381 13 ,549 19.391- 99 .00 91,44 £ 

ATOM 431 NH1 ARG A 36 73.605 14.375 20.079-99.00 79.32 H 

ATOM 432 NH2 ARG A 36 74.009 13.144 18.185-99.00 78.73 H 

ATOM 433 N ASP A 37 80.217 12.677 24.063 1.00 $1.30 

20 ATOM 434 CA ASP A 37 81.606 12.710 23.601 1.00 44.91 £ 

AXCM 435 c ASP A 37 92,410 U^ill 24,043 1-00 24,99 £ 

ATOM 436 o ASP ft 37 $3,211 10,979 23,231 1,QQ 42.22 Q 

ATOM 437 CB ASP A 37 82.347 14.048 23.718-99.00 47.07 £ 

ATOM 438 CG ASP A 37 81.881 14.887 24.876-99.00 $2.99 £ 

25 ATOM 439 OD1 ASP A 37 80.679 14.839 25. 204-99. QO 64.45 Q 

AXCM 440 OD2 ASP A 37 82.711 15.638 25. 429-99.00 69.84 Q 

ATCM 441 N..CLVA 39 62,129 10,950 25.235 l.QQ 19.39 H 

AlSd 442 CA CfcV A 39 82.79Q 9,717 25,592 1-OQ 27.84 £ 

ATOM 4 43 C GLU A 3? 82.203 8.527 24.901 1.00 37.14 £ 

30 ATOM 444 O GLU A 38 82.873 7.511 24.699 1.00 35.04 Q 

ATOM 445 CB GLU A 38 82.691 9.435 27.207 1.00 25.18 £ 

ATOM 446 CG GLU A 38 83.116 10.549 28.183 1.00 37.45 £ 

ATOM 447 CD GLP A 38 82.807 10.212 29.655 1.00 21.13 £ 

ATOM 448 OE1 GLU A 38 81.623 9.997 30.014 1.00 55.97 Q 

35 ATOM 449 OE2 GLU A 38 83.757 9.979 30,419 liOO .99,79 Q 

ATCM 450 N LEU A 39 80.948 8.610 24.478 1.00 25.52 U 

ATCM 451 CA LEV A 3? . 8<M40 7,493 23,739 1,00 19,17 £ 

ATOM 452 C LEU A 39 79.291 7.764 22.825 1.00 20.34 £ 

ATOM 453 O LEU A 39 78.152 7.810 23.259 1.00 26.35 Q 

40 ATOM 454 CB LEU A 39 80.123 6.313 24.657 1.00 14.56 £ 

ATOM 455 CG LEU A 39 79.410 5.075 24.058 l-QQ 19,52 £ 

ATOM 456 CD1 LEU A 39 80.205 4.392 22.994 1.00 16,84 £ 

ATCM 457 CD2 LEU A 39 78.890 4.0 51 25.084 1.00 17_.41 £ 

ATOM 458 N ASN A 40 79.598 7.8 80 21.543 1.00 16,73 

45 AXi22 459 CA ASN A 40 78.548 7.971 20.540 1.00 21.55 £ 
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45 



AXOL 



504 CZ ARG A 



77.798 6.649 20.308 1.00 24.53 



78.328 5.720 19.688 1.00 10.96 



79.130 8.367 19.216 1.00 18.45 



78.054 8.727 18.22 5 1.00 42.19 



78.327 9.093 17.080 1,00 38.89 



76.827 8.730 18.697 1.00 23.71 



76.543 6.622 20.754 1.00 21, r 0B 



75 i $19 5.465 20.650 1.00 15.03 



75,225 5.068 19.213 1.00 18.22 



71,681 3.971 18.980 1.00 15.7? 



71-12$ 5.705 21.532 1.00 15.85 



74.822 6.029 22.974 1.00 21.90 



73.601 6.413 23.749 1.00 20.59 

7?t1Bl 4.796 23.609 1.00 17.97 



75,5 42 5.916 18.238 1.00 12.45 



75,256 5.607 16.831 1.00 15.99 

76,290 4.680 16.280 1.00 26.18 

76:066 4.039 15.2 57 1.00 22.41 

75.2B2 6.873 15.984 1.00 17.85 



74.180 7.854 16.39 9 1.00 30.70 
74.318 9.184 15.704 1.00 24.31 



72.764 7.241 16.208 1.00 31.13 



77.462 4.705 16.911 1.00 26.87 



81.083 3.758 16.?80 1.00 30.68 



60.971 2.551 16.082 1.00 32.36 



82.187 4.308 16.499 1.00 37.83 



79,139 1,511 16,377 1,VP 16.99 



77.978 0.173 16.789 1.00 17.67 



79,237 -9,163 17.392 1.00 20. 40 



79.206 -1.126 18.444 1.00 26.27 



77.504 -0.617 15.581 1.00 13.85 
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80.335 -0.301 16.682 1.00 15.63 
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91,91P -Q.225 19.521 1,PQ 29,19 



82.244 -0.937 19.457 1.00 27.65 



82.684 -0.261 16.203 1.00 27.46 



83.463 -1.338 15.495 1.00 92.03 



84.854 -1.418 16.077 1.00100.00 



85.636 -2.533 15.527 1.00100.00 



86.092 -3.570 16.236 1.00100.00 
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ATOM 505 NH1 ARG A 45 85.791 -3.6.95 17.547 l.OOlflO.OO ft 

ATOM 506 NH2 ARG A 45 flfi.,773 H.5H 15.642 1.00100.00 

ATCM 507 N MA A 46 01.772 1.999 38.629 .1.00,31,04 H 

ATCM 508 CA AIA A 46 82.045 1.743 19.8 81 1.00 24. 72 £ 

5 ATCM 509 C AIA A 46 ftJLulll 1.176 20.899 1.00 ^7 .73 £ 

ATCM 51Q 0 AIA A ii 8.1.512 0.625 22.027 1.00 22.73 £ 

ATOM 511 CB AIA A, 43 .. 81.839 3.221 19.751 1.00 27.16 £ 

ATCM 512 N VAX A 47 79,835 1,119 20.531 1.00 17.54 H 

ATOM 513 CA VAL A 47 ZixlIS 0.608 21.5 0B 1.00 21.41 £ 

10 ATOM 514 C VAL A 47 79.262 -0.812 21.914 1.00 30.25 £ 

ATCM 515 O VAL A 47 79.192 -1.202 23.097 1.00 15.85 Q 

ATCM 516 CP VAX A 47 77.470 0.668 20.989 I.QO 1.8.59 £ 

AI2M 517 CGI VAX A 47 76.503 0.042 22.012 1.00 16.88 £ 

ATCM 518 c<52 VAL h 47 77,115 2.Q96 2Q ¥ 756 1.09 16.28 £ 

15 ATCM 51 9 tf HIS A 48 79.692 -1.585 20.920 1.00 21.00 H 

AXflM 520 CA HIS A 48 80.028 -2,969 21.192 1.00 20.17 £ 

AlflM 521 C HIS A 48 81.268 -3.079 22.117 1.00 32.98 £ 

ATCM 522 O HIS A 48 81.289 -3.850 23.102 1.00 28.20 Q 

ATOM 523 CB HIS A 48 80.063 -3.801 19.855 1.00 14.93 £ 

20 ATCM 524 CG HIS A 48 78.686 -4.172 19.338 1.00 26.67 £ 

ATCM 525 ND1 HIS A 48 78.085 -5.394 19.600 1.00 28.83 N 

ATCM 526 CP2 HIS A 48 77.758 -3.448 18.659 1.00 25.56 £ 

ATOM 527 CE1 HIS A 48 76.887 -5.430 19.043 1.00 20.08 £ 

ATOM 528 KE2 HIS A 48 76.660 -4.260 18.475 1.00 25.22 H 

25 AXOf 529 N ASP A 49 82.217 -2.170 21.902 1.00 22.62 H 

ATCM 530 CA ASP A 49 83.455 -2.169 22.674 1.00 24.23 £ 

ATCM 531 C ASP A 49 83.171 -1.899 24.122 1.00 38.72 ; £ 

ATCM 532 O ASP A 49 B3.708 -2.5 51 25.027 1.00 35.44 £ 

ATOM 533 CB ASP A 49 84.396 -1. 112 22.127 1.00 30.29 £ 

30 ATCM 534 CG ASP A 49 84.991 -1.503 20.775 1.00 52.45 £ 

ATOM 535 OD1 ASP A 49 85.007 -2.726 20.449 1.00 42.67 Q 

ATCM 536 OD2 ASP A 49 85.416 -0.587 20.029 1.00 73.76 £ 

ATCM 53 7 N PHE A 50 82.294 -0.929 24.324 1.00 32.19 K 

ATOM 538 CA PHE A 50 81.902 -0.550 25.649 1.00 29.76 £ 

35 ATCM 539 C PHE A 50 81.299 -1.765 26.359 1.00 30.31 £ 

ATCM 540 O PHE A 50 81.715 -2.124 27.449 1.00 29.22 , Q 

ATCM 541 CB PHE A 50 80.892 0.610 25.576 1.00 23.82 ; £ 

ATOM 542 CG PHE A 50 80.137 0.843 26.859 1.00 19.13 £ 

ATCM 543 CP1 PHE A 50 80.740 1.515 27.931 1 - 00 20.11 , £ 

40 ATOM 544 CD2 PHE A 50 78.835 0.360 27.018 l.QQ 13.99 £ 

ATOM 545 CE1 PHE A 50 80.034 1,742 29.129 1-00 25.81 £ 

ATOM 546 CE2 PHE A 50 78.114 0,55? 28.212 liOO 22.81 £ 

ATOM 547 CZ PHE A 50 78.698 1.276 29.259 1.00 23.40 £ 

ATCM 548 N PHE A 51 80.280 -2.367 25.768 1.00 21.75 U 

45 ATCM 549 CA PHE A 51 79.655 -3.451 26.457 1.00 22.6] £ 
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AICM 550 C PHE A SI 80.64fi -4.603 26.612 1.00 34.01 £ 

ATCM 551 O PHE A 51 80.550 -5.401 27.590 1.00 25.28 £ 

AXCM 552 CB PHE A 51 78.369 -3.898 25.751 1.00 22.63 £ 

AXOl 553 CG PHE A 51 77.158 -3.140 26.170 1.00 27.58 £ 

5 ATCy 554 CD1 PHE A 51 76.426 -3.525 27.280 1.00 21. 7B £ 

him 555 CP2 PHE A 51 78.883 -2.199 25.389 1.Q9 19,55 £ 

AX£M 556 CE1 PHE A 51 75.287 -2.798 27.882 1,99 28.34 £ 

AXGM 557 CE2 PHE A 51 75.492 -1.403 25.734 1,00 14.47 £ 

ATOM 558 CZ P HE A 51 74.797 -1.744 26.878 1.00 14.55 £ 

10 him 559 N ALA A 52 81.57$ -4.798 25.859 1,99 28.43 U 

ATOM 560 CA ALA A 52 82.587 -5.793 25.714 1.00 29.44 £ 

AlflM 551 .£ ALA A 52 83. €97 -5.569 28.78A 1.99 43. 76 £ 

AX2M 562 O ALA A 52 84.502 -6.446 27.022 1.00 40.33 £ 

AXGM 563 CB ALA A 52 83.228 -6.049 24.344 1.00 24.25 £ 

15 AT.QM 564 JJ SERA 53 83.702 -4.382 27.385 1.00 31 .96 H 

AI2M 565 CA SERA 53 84.705 -4.090 28.377 1.00 21.06 £ 

ATOM 56$ £ SERA 53 84.196 -3.625 29 .709 1.00 26.41 C 

AICM 567 O SER A 53 84.985 -3.492 30.611 1.00 36.12 _Q 

him 568 CB SER A 53 85.709 -3.088 27.843 1.00 14.22 £ 

20 ATOM 569 OG S ERA 53 85.140 -1.807 27. 790 1.00 56.90 £ 

him 579 -M GLP A 54 82.892 -3.431 29.874 1.00 22.38 H 

ATOM 521 CA GLU A 54 82 .380 -2.893 31.139 1.00 17.27 £ 

AI2M 572 C ELV A 54 B1.5B4 -3.735 32.118 1.00 26.32 £ 

AlfiM 57? 9 ELV A 54 91.229 -3.291 33.191 1.09 37,43 £ 

AX24 575 TO SLV A 54 82.573 -9.543 39,282 q.OQ 44,77 £ 

AlfiM 578 CP SLV A 54 93.86? -9,142 31.194 1.99 98,33 £ 

Aiai 577 OEl GLU A 54 83.392 -0.232 32.428 1.00 50.11 Q 

AXQM 578 OE2 GLU A 54 B4.7B5 0.198 30.692 1.00 50.99 £ 

30 him 579 N ARG A 55 81.268 -4.971 31.804 1.00 29.63 H 

ATOM 5B0 CA ARG A 55 80.636 -5.748 32.854 1.00 33.32 £ 

A1£M 581 C ARg A 55 79,347 -5,149 33.378 1.99 38.45 £ 

AlfiM 582 ft ARg A 55. 79.214 -4.997 34.57.8 1.99 49.18 Q 

ATM 583 CB ARG A 55 81.621 -5.875 34.045 1.00 57.61 £ 

35 him 584 CG A RG A 55 82.666 .-7.028 33.960 1.00100.00 £ 

ATOM 585 CD ARG A 55 82.805 -7.805 35.305 1. 99109. 99 £ 

ATOM 586 NE ARG A 55 82.838 -9.270 35.146 1.00100.00 H 

him 587 CZ ARG A 55 83.206 -10.129 36.102 1.00100.00 £ 

ATOM 588 NH1 ARG A 55 83.583 -9.681 37.301 1,99109-99 E 

40 AX2M 589 NH2 A RG A 55 83.208 -11.440 35.855 1.00100.00 K 

ATOM .599 N ILEA 56 78.367 -5.029 32.491 1.00 42.25 

him 591 CA ILE A 56 77.064 -4.434 32.794 1.00 25.49 £ 

him 592 ... C ILE A 56 75.982 -5.474 33.244 1.00 20.18 £ 

him 593 O ILE A 56 75.897 -6.579 32.704 1.00 24.74 O 

45 him 594 CB ILE A 56 76.672 -3.512 31.531 1.00 26.89 £ 
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AX2M 640 rol LEU A 61 



67. $15 1.Q10 24,10? 
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ATOM 685 CB ILEA 69 55.251 20.432 16.490 1.00 ?Q. 11 £ 

AXflM 696 ggl ILB A 62 54.271 19.683 17.385 1.00 24.27 £ 

ATOM $97. ILE A 6? 54.610 20.749 15.1 81 1.00 47.53 £ 

ATCfl 699 CP1 ILE A 69 53.259 20.608 18.056 1.00 65.71 £ 

5 ATOM 689 N VAL A 70 58.010 21.327 15.269 1.00 23.03 H 

AlflB 6?Q CA VAL ft 70 59.. 797 31t9» 11.19? .l-PP.i9i34 £ 

ATCM 691 C VAL. A 7Q 59.983 21.011 13.840 1.00 24.42 £ 

AXGM 692 O VAL A 70 60.335 20.829 12.662 1.00 24.14 £ 

AXSM 693 CP VM A 70 59.304 23r404 11.467 l.PP 21,37 £ 

10 AlfiM 694 CGI VAL A 70 60.137 23.907 13.281 1.00 17.79 £ 

ATOM €95 CG2 VAL A 70 58.136 24.410 14.678 1.00 15.74 £ 

AIGM 696 N ALA A 71 6Q t frgl 2Q,45P H.pgi 1,PP 19t69 H 

AIQM 697 CA ALA A 71 61.782 19.617 14.572 1.00 16.57 £ 

ATOM 698 C ALA A 71 61.427 18.289 13.910 1.00 23.36 _£ 

15 A1SM 699 0 ALA A 71 61.980 17.923 12.849 1.00 21.84 Q 

AXSM 700 CB AIA A 71 62.685 19.439 15.805 1.00 9.36 £ 

AlflM 7Q1. H ASN A 72 60.463 17.598 14.511 1.00 16.80 H 

AT.QM 702 CA ASN A 72 59.998 16.357 13. 923 1.00 18.84 £ 

Aiai 703 C ASN A 72 59.608 16.539 12.440 1.00 23.87 £ 

20 ATSai 2PJ 2 ASN A 72 59-919 15-696 11.593 l.PP 21.52 Q 

Aiaa 705 CB ASN A 72 58.835 15.806 14.738 1.00 8.60 £ 

him 7Q6 eg A S N , A 72 59.3Q9 15,P 13 1 5 .911 l .PP 23. 75 £ 

AXCH 7P7- QD1 ASN A 72 59.559 13.9P9 15.91P l.PP 23.99 Q 

AlfiM 708 ND2 ASN A 72 59.572 15.701 16.996 1.00 9.96 H 

25 ATOM 7P9... M ASN A 73 58.931 17.647 12. 138 1.00 23.07 N 

AXPJi 7ip CA ASN A 73 59,521 17.971 IP. 761 l.QO 26.P5 Q 

ATOM 711 C ASN A 73 59.665 18.454 9.817 1.00 26.95 £ 

ATOM 712.. O ASN A 73 59.613 18.276 8.569 1.00 22.13 2 

ATOM 71 3 CB ASN A 73 57.383 19.001 10.600 1.00 14.66 £ 

30 ATOM 714 CG ASN A 73 5 6.015 18.349 10.987 1.00 19.88 , £ 

ATOM 715 OD1 ASN A 73 55.620 17.468 10.217 1.00 27.02 Q 

ATOM 716 NT) 2 ASN A 73 55.322 18. 732 12.051 1.00 20.78 U 

AIPM 71 7 , y THE . A 74 . 6P.71P 19-P29 IP. 419 l.PP 19.69 M 

ATOM 718 CA THR A 14 61.845 19.540 9,,657 l.PP 1Q.Q7 , £ 

35 ATOM 719 C THR A 74 62.968 18.548 9.375 ItPP 21. PP £ 

ATOM 720 O THR A 74 63.537 18.561 8.289 1.00 11.75 Q 

ATOM 721 CB THR A 74 62.411 20.746 10.306 1.00 29.10 £ 

ATOM 722 OG1 THR A 74 6 1.370 21.714 10.457 1.00 23.24 £ 

ATOM 723 CG2 THR A 74 63.541 21.299 9,452 l.PP 21.63 £ 

40 ATOM 724 N TYR A 75 63.230 17.636 10.310 1.00 17.10 H 

ATOM 725 CA TYR A 75 64.267 16.620 10.112 1,00 9.Q7 £ 

ATOM 726 C TYR A 75 63.733 15 .203 10.318 1.00 £U2 £ 

ATOM 727 O TYR A 75 64.143 14 .542 11.267 1,PP 15.59 P 

ATOM 728 CB TYR A 75 65.302 16. 825 11.188 1.00 11.89 £ 

45 ATOM 729 CG TYR A 75 65.779 18.234 11.252 l.PP 27.12 £ 
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ATOM ...-966 CP1 ILE A 91 ,66,32Q ~2v619 21.997 1.00 16,23 £ 

ATOM 867 N HIS A 92 71.446 -4.963 20.785 1.00 24.12 ft 

ATOM 969 CA HIg A 92 72,293 -6,915 20.213 1,09 26,71 £ 

5 ATOM 869 C HIS A 92 73.609 -6.251 21.071 1.00 29.30 £ 

ATOM 970 Q HIS A 92 73.983 -7,366 21.443 1.00 18.58 £ 

ATOM 871 CB HIS A 92 72.561 -5.682 18.775 1.00 22.23 £ 

AX2M 972 Cg HIS A 92 73.365 -6.720 19,077 1.00 26.32 £ 

ATPM 973 NPl HIS A 92 72,796 -7,711 17,307 1.00 27,19 H 

10 ATOM 87 4 CD2 HIS A 92 74.699 -6.978 18.106 1.00 21.95 £ 

AK3U 975 CE1 HIS A 92 73r755 "9,197 16.926 1,PQ 23*66 £ 

ATOM 876 NE2 HIS A 92 7 4.918 -8.062 17.296 1.00 17.36 H 

AXflM 977 W ALA A 93 74.328 -5.187 2 1.333 1.00 15.66 H 

AlflM 878 CA ALA A 93 75.530 -5. 301 22.110 1.00 11.88 £ 

15 ATQK 979 C ALA A 93 75.222 -5.900 23 .512 1.00 28.78 £ 

AXOl 990 P ALA A 93 75.912 -6.790 24 .037 1.00 25.23 £ 

AIGM 991 CP ALA A 93 76 .139 -3,959 22.221 1.90 6.30 £ 

AXCM 992 W ALA A 94 74.142 -5.442 24.113 1.00 18.82 fi 

ATOM 883 CA ALA A 94 73.777 -5.971 25.399 1.00 15.61 £ 

20 AIQM 8 84 C ALA A 94 73.593 -7.503 25.301 1.00 28.39 £ 

ATOM 885 O ALA A 94 74.133 -8.263 26.099 1.00 21.67 __Q 

ATOM 996 CP ALA A 91 72.419 -5, 279 .. 25 .913 1-QQ 19,46 £ 

AXQM 887 N HIS A 95 72.814 -7.966 24.329 1.00 26.35 H 

ATOM 888 CA HIS A 95 72.551 -9.396 24.271 1.00 24.89 £ 

25 ATOM 889 C HIS A 95 73.845 -10.176 24.140 1.00 22. Bl £ 

ATOM 890 O HIS A 95 74.077 -11.136 24.865 l.QQ 21.14 Q 

ATOM 891 CB HIS A 95 71.571 -9.778 23.129 1.00 22.39 £ 

ATOM 892 CG HIS A 95 7 1 .554 -11,250 22.831 1.00 28.73 £ 

ATOM 893 NPl HIS A 95 70.979 -12.1 82 23.682 1.00 22.83 H 

30 ATOM 894 CD2 HIS A 95 72 . 1 59 - 11 . 964 21.845 1.00 25.22 £ 

ATOM 895 CE1 HIS A 95 71.171 -13.397 23.196 1.00 22.72 £ 

AX2M 996 NE2 HIS A 95 71. 9U -13,296 22.101 liQQ 21.90 , N 

ATOM 89 7 N GLN A 96 74.709 -9.658 23.281 1.00 19.97 JJ 

ATOM 898 CA GLN A 96 75.960 -10.299 22.917 1,00 22,27 £ 

35 AT CM 899 C GLN A 96 76.877 -10 .353 24.086 1.00 26.58 £ 

ATOM 900 0 GLN A 96 77.836 -11.093 24.088 21.17 Q 

ATOM 901 CB GLN A 96 76.642 -9.492 21.818 } t PP 23.39 £ 

ATOM 902 CG GLN A 96 7 7.043 -10.299 20.596 1.00 61.06 £ 

ATOM 903 CD GLN A 96 78.033 -9.557 19.675 j.OO ,75 t fl3, £ 

40 ATOM 904 OE1 GLN A 96 78.999 -8.941 20.131 1,00 56.89 £ 

ATOM 905 NE2 GLN A 96 7 7.815 -9.668 18.366 1, 00100.00 H 

ATOM 906 N ASN A 97 76.652 - 9.500 25.060 1.00 22.15 £ 

ATOM 907 CA ASN A 97 77.537 -9.536 26.208 1.09 14,74 £ 

ATOM 908 C ASN A 97 76.732 -10 .022 27.387 l.QQ 2,9 f 79 £ 

45 ATOM 909 O ASN A 97 77.049 -9. 762 28.564 1.00 27.09 2 
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78.241 -8. 201 26.46 2 1.00 12.93 



AT9M 



911 CG ASN A 97 



79,2W -7.897 25.107 1,00 21,91 



ATPM 



912 OD1 ASN A 97 



80.331 -8.518 2S.37S 



ATOM 



913 ND2 ASN A 97 



78,839 -7.13? 21.392 



ATOM 



914 N ASP A 98 



75.666 -10. 732 27.055 



ATOM 



915 CA ASP A_98 



71 . 9Q7 -11 ,361 28.0&9 



AXCM. 



916 C ASP A 98 



71.100 -10,379 29. 161 



917 P ASP A 98 



74.505 -10.634 30.367 



10 



A3 
A3 



918 CB ASP A .98 



75.791 -12,150 28.7Q0 



919 CG ASP A 98 



75.016 -13.712 29.053 



ATOM 



920 OD1 ASP A 98 



73.775 -13.749 28.877 



ATOM 



921 QD2 ASP A 98 



75.65$ -11,670 29.512 



ATOM 



922 N VAL A 99 



73.879 -9.235 28.730 



ATM. 



923 CA VAL A 99 



73.157 -8.351 29,635 



15 



ATOM 



924 C VAL A 99 



71.706 -8.868 29.530 



ATOM. 



VAL A 99 



71.159 -9.088 28.422 



ATOM 



926 CP VAL A 99 



73,261 -6,900 29,206 



ATOM. 
ATOM 



927 cgl VAL A 99 



72.517 -6.015 30,19? 



928 052 VAL A, 99 



74.720 -6.515 29.225 



20 ATOM 929 W ASN A 100 



71.119 -9.262 30,662 



ATOM 



930 CA ASN A 100 



69.852 -9 .925 30.613 



ATOM. 



ASN A 100 



68.648 -9.034 30.910 



A3 



932 O ASN A 100 



67.198 -9.377 30,582 



ATOM 



933 CB ASN A 100 



69.846 -11.157 31.527 



ATOM 



eg ASN A 100 



68.721 -12.112 31.180 



.00 20.38 




ATOM 



936 ND2 ASN A 100 



67.716 -12.240 32.076 



ATOM 



937 N LYS a 101 



68,. 91 ] -7.923 31.584 



him. 



938 CA LYS A 101 



67.970 -6.916 31.994 



30 



ATOM. 



ITS A 101 



€8,107 -5.510 31,323 



ATQEL 
ATOM 



LYS A 101 



69.151 -1.850 31,377 



941 CB LYS A 101 



67.996 -6.807 33.521 



ATCti 



912 CG LYS A 101 



67.464 -8.054 34.205 



ATOM, 



94? cp LYS A ,101 



67.218 -7. 7^9 35.668 



35 



ATOM 



911 CE LYS A 101 



66.206 -6.569 35.865 



ATOM. 



915 HZ LYS A 101 



64.750 -7.006 35.825 



ATOM 



916 N LEU A 102 



67.013 -5. 043 30.732 



AXCM 



947 CA LEU A 102 



67.003 -3.744 30.Q92 



ATOM 



948 



LEU A 102 



65.612 -3. 115 30.156 



40 



ATOM 



949 



LEV A 102 



64.590 -3, 811 30.102 



ATOM 



950 eg LEV A 102 



67.465 -3.B98 28.636 



ATOM 



951 CG LEU A 102 



67.553 -2,711 27.651 



ATOM 



952 CD1 LEU A 102 



68,628 '2.985 26.559 



ATOM 



953 CD 2 LEU A 102 



66.162 -2.407 26.995 



45 



ATOM 



954 N LEU A 103 



65.595 -1,798 30.318 



«00 57.17 



,0 0 , . 3 1, 8.8. 



,00 27.98 



.00 29.25 



-00 37.53 



.00 36.12 



00 36.37 



00 88.62 



,00 82.53 



,00100.00 



oo 27.13 



rOO 21.57 



00 16.15 



•00 19-47 



.00 21.18 



00 11.58 



,00 30.10 



.00 17.39 



.00 25.77 



.00 21,95 



rOO 20,88 



.QQ 11.98 



1.00 16.35 



■00 17.91 



00 25.43 



.00 25.29 



,00 19. 



00 29.28 



iQQ 9.31 



1.00 38.93 



-Q0 13,38 



■QO 15.26 



00. 22,2 2 



•QQ 15.10 



.00 18.55 



■00 18.92 



,00 11,23 



.00 15.51 



,00 9.65 



00 13.10 



.00 17,05 



Ji 
JI 

S. 

S. 

J2 

-Q 

JI 

-£ 

S. 
J} 



Si 
SL 
@ 
JI 
Ji 
Si 
Si 
SI 

-£ 

Si 



Ji 



JI 
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AISM 955 CA LEV A 193 64.356 -1.036 30.265 l.QQ 1fi ,33 £ 

frTOM 956 C LEU A 103 64.346 -0.072 29.046 l.QOjg.flS £ 

AISM 252 Q LEtt A 193 $5,115 0.789 28.875 l.oo iq.^r £ 

ATOM 958 CB LEU A 103 64.099 -Q.299 31.562 1.00 12.28 £ 

5 AXOJ 9?? . C<? LEV A 193 ,S2. €9$ 0.259 31.594 1.00 14.1.1 £ 

ATOM 960 CD1 LEU A 103 61-645 -0.822 31.902 1.00 10.31 £ 

ATOM 961 CD2 LEU A 103 €2.646 1.360 32.601 1.00 12.30 £ 

ATOM 962 N EHE A 191 63.417 -0.333 28.140 1.00 16.41 £ 

AISM 9S3 CA PHE A 101 63.215 0.486 26.956 1.00 18.32 £ 

10 &TPM, 954 C EHE A 194 62.126 1.546 27.249 1.00 21.85 £ 

AXOl 955 O PHE A 104 61.168 1.271 27.992 1.00 18.36 £ 

ATOM 966 CB PHE A 104 62.796 -0.386 25.793 1.00 9.86 £ 

AlflM 967 Cg PHE A 194 62.732 0.348 24.508 1.00 16.81 £ 

AIQM 968 CD1 PHE A 104 63.894 0.714 23.840 1.00 25.04 £ 

15 AXSB 969 CP2 PHE A 104 61.511 0.795 24.005 1.00 22.59 £ 

A1SM 970 £S1 PHE A 194 , 63.836 1.448 22.619 1.00 31.26 £ 

A103 971 £E2 PHE A 194 61.449 1.535 22.814 1.00 15.59 £ 

AXQM 222 CZ PHE A 104 62. 625 1.895 22.139 1.00 11.67 £ 

&TO*J 973 H LEU A 105 62.341 2.762 26.734 1.0Q 20.33 fi 

20 AXOl 974 CA LEV A 195 61.416 3.897 26.904 1.00 18.10 £ 

AT9K 9?5 C LEU A 105 60.711 4.237 25.634 1.00 17.04 £ 

AT9H 9?S P LEU A 105 61.315 4.680 24.665 1.00 18.83 Q 

AXflM 977 CB LEU A 105 62.178 5.146 27.214 1.00 17.49 £ 

978 £S LEV A 195 62.434 5.544 28.644 1.00 27.17 £ 

25 AXQM 979 CD1 LEU A 105 62.630 4.349 29.574 1.00 19. 16 C 

ATOM 980 CD2 LEU A 105 63. 688 6.347 28.529 1.00 23.59 C 

ATOM 9P1 W GLY A 106 59.407 4.153 25.652 1.00 20.66 H 

AIQM 992 CA SLY A 19$ 58.679 4.536 24.455 1.00 21.03 _£ 

ATOM 983 C GLY A 106 58.080 5.935 24.597 1.00 17.32 £ 

30 994 Q GLY A 106 58.690 6.858 25.113 1.00 26.89 Q 

AXSM 99? M SEE A 197 56.831 6.047 24.219 1.00 22.05 H 

AI2M 986 £A §EP A 197 56.177 7.317 24.288 1.00 22.12 £ 

A1QM 987 C sep a 197 5^,gp6 7r212 23,923 1.99 19, P6 _£ 

AXCM 988 O SER A 107 54.314 6.545 22.963 1.00 27.42 O 

35 AI2M 999 CB $ER A 197 56,992 9j232 23,399 1.99 29.99 £ 

AISM 999 os SEP A 1Q7 5?. 947 9*. I?? 22,776 1,00 42.9? Q 

ATOM 991 N SER A 108 53.826 7.890 24.671 1.00 27.42 H 

AXSfcl 992 CA SER A 108 52.382 7.947 24.339 1.00 26.43 £ 

ATSM 993 ,C SER A 106 52.144 8.259 22. 842 1.00 30.97 C 

40 AlflM 994 O SER A 108 51.242 7.709 22.217 1.00 33.46 £ 

ATOM 995 CB SER A 108 5 1.710 9.07 2 25.144 1.00 19.87 £ 

AXSM 996 OG SER A 108 52.495 10.266 25.0 71 1.00 70.88 £ 

ATOM 997 N CYS A 109 52.927 9.180 22.278 1.00 24.73 H 

AlfiM 998 CA CYS A 109 52.728 9.549 20.88 0 1.00 25.61 £ 

45 aX£M 999 C CYS A 109 52.970 8.482 19.815 1.00 21.29 £ 
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ATOM 1000 Q CYS A 109 



52.967 8. 737 16.623 1.00 31.31 



Si 
A 

-£ 



ATOM 1001 CB CYS A 109 



53.369 10.899 20.544 



PP 39.55 



002 SG CYS A 109 



55.153 11.077 20.847 



00 49.24 



ATOM 



003 N ILE A 110 



53.101 7.264 20.258 



,PP 19,31 



ATOM 



QUA CA ILE a no 



53.329 6.150 19.379 



PP 28, IP 



ATOM 



HE a no 



51.977 5.489 19.08? 



.00 15.38 



him. 



ILE A llfl 



51.895 4.592 18.268 



PP 1$,52 



ATCM 



0P7 CP ILE A 110 



54,154 5,153 2Q,2Pg 



tOP 4Q,45 



008 CGI ILE A 110 



55,$P4 5-51P 20,13$ 



PP 39.02 



10 



ATOM 



Aim 



ATOM 



009 CG2 ILE A 110 



53.879 3.715 19.875 



T 0P $1.33 



filfl CD1 ILE A lip 



56.429 4.338 20.549 



■00 82.74 



fill M TYR A m 



50.951 5.H2 19,954 



iPP 11,91 



Aim 



Aim 



012 CA TYR A 111 

013 C TYR A 111 



49, HP 5,227 19,679 



.00 13.96 



48.956 5.831 18.459 



tPP 20. IP 



15 



ATOM 



Iflli 



TYR A 111 



49.302 6.933 18.056 



tPP Ut71 



ATOM 



015 CB TYR A 111 



48.763 5.468 20.921 



.00 9.63 



ATOM 



016 CG TYR A 111 



49.117 4.550 22.065 



PP 14,94 



Aim 



017 CD1 TYR A 111 



48.985 3.159 21.938 



■00 9-73 



ATOM 



£15 CD2 TYR A 111 



49,755 5,P39 23,21$ 



PP 14.9$ 



20 A3 



ATOM 



P19 CE1 TYR A 111 



49.344 2.273 23.014 



.00 6.53 



020 CE2 TYR A 111 



50.146 4.155 24.272 



00 13.66 



.£ 

S2. 

_c_ 

JI 
J? 
-£ 



Aim 



1P21 CZ TYR A 111 



49.873 2.787 24.171 



•PP 17,9$ 



Aim 



P22 QH TYR A 111 



50.266 1.927 25.157 



OP 11,37 



Aim 



023 W PRC A 112 



47,974 , 5-1 45 17 T 972 



tPP 22,56 



25 



Aim 



024 CA PRO A 112 



47.279 5.743 16.721 



00 23.44 



Aim 



02$ o PRQ A 112 



46.197 7.453 18.115 



tPP 1 9,7 2 



ATOM 



027 CB PRO A 112 



4$, 290 4,544 1$,25? 



,00 15,69 



ATOM 



1028 CG PRO A 112 



46.895 3.343 16.769 



.00 22,63 



30 



ATOM 



029 CP PRQ A 112 



47 f 593 3,733 19.096 



,00 16, 1Q 



Aim 



030 N LYS A 113 



46.418 7.866 15.915 



00 19-48 



AIPH 



ATOM 



031 CA LYS A 113 



45,79? 9,1S7 15,994 



.00 23.50 



Q2Z 



LYS A 113 



44.396 9.077 16.655 



00 34.29 



Aim 



033 Q LYS A 113 



44.046 9.887 17.524 



-00 46.14 



35 



ATOM 



ATOM 



034 CB LYS A 113 



45,575 9,735 14,593 



OP 30.04 



035 CG LYS A 113 



46.219 11.124 14.477 



>PQ 43.79 



ATOM 



03$ CP LYS A 113 



45.381 11.941 13.515 



PQ1 PP,PP 



Aim 



037 CE LYS A 113 



44.361 12.836 14.250 



.00100.00 



Aim 



03 B NZ LYS A 113 



43.480 13.625 13.304 



ooioo.oo 



40 



Aim 



Aim 



039 N LEV A 114 



43.591 8.103 16.250 



.00 26.33 



filfi C A , .LEV A . 114 



42.267 7. 957 16.833 



PP 2P,$5 



ATOM 



Aim 



041 C LEU a 114 



42.083 6.792 17.760 



.00 18,44 



QA1 



LEV A 114 



41.002 6.278 17.918 



OP 34.P4 



45 



ATCM 



Aim 



043 CB 



044 CG 



LEU A 114 
LEV A 114 



41.194 8.002 15.780 



.00 24.37 



41.587 9.122 14.830 



PP 40,9$ 



WO 99/64618 



PCT/US99/11576 



105 

M9H 1945 CP1 LEV h 114 . 13.504 i.oo ao.io £ 

ATOM 1046 CD2 LEU A 114 41.139 10.512 15.300 1.00 9B.BS £ 

ATQM 1 047 y ALA A 115 43.rlQ3 6.473 18.527 1.00 ?9.00 H 

ATOM 1949 CA MA A 11$ 42.929 5.446 19.526 1.00 25.66 £ 

5 ATOM 1049 C ALA A 115 41.722 5.727 20.454 1.00 28.76 £ 

ATOM 1050 O ALA A 115 41,364 6.855 20.682 1.00 24.12 Q 

AT CM 1051 CB ALA A 115 44.177 5.272 20.326 1.00 16.96 £ 

ATOM 1052 N LYS A 116 41.137 4.675 20.998 1.00 30.21 H 

him 19?? CA LYS A 11$ 4Q,Q3$ 4.792 21.928 1.00 25.85 £ 

10 him 1954 C LYS A 11$ 49, $59 5.248 23.195 1.00 14.18 £ 

ATm 1Q5& 0 LYS A 11$ 41,759 4.781 23.535 1.00 23.51 £ 

him IM cp LYS A 11$ 39.3ft? 3.415 22.116 1.00 22.05 £ 

him 1957 CS LYS A 11$ 39,953 3.032 23.5 24 1.00 55.38 £ 

ATOM 1059 CP LYS A 11$ £LJ&£3 1.955 23.549 1.Q01Q0.0Q £ 

15 AXflM 1959 CE LYS A 11$ 37,129 1.953 24.835 l.ooioo.oo £ 

him 19$0 NZ LYS A 11$ 35.76.7 1.310 24.630 l.ooioo.oo h 

him 10$1 N SLW A 117 49.921 6.208 23.856 1.00 lfl.23 ft 

ATPW.. CA GLN A 117 40.456 6.757 25.180 1.00 21.01 £ 

ATCM 1Q$3 C <?LH A 117 . 39. $95 6.178 26.383 1.00 30.96 _£ 

20 ATOM 1064 O GLN A 117 38.483 6.009 26. 345 1.00 27.66 Q 

ATEtt 1065 CB GLN A 117 40.215 8.263 25.179 1.00 11.32 __£ 

hXm 1066 CG GLN A 117 40.849 8.912 23.948 1.00 12.12 £ 

him 1067 CD GLN A 117 42.404 8.823 23.954 1.00 2^.10 £ 

him 1068 OE1 GLN A 117 43.041 8.628 22.696 1.00 47.88 £ 

25 ATOM 19$9 HE2 CLF* A 117 43.991 9-95? 25.131 1.99 14-24 N 

ATOM 1070 N PRO A 118 4 0.374 5.9 92 27.499 1.00 30.02 H 

him 1071 CA PRO A 118 41.826 6.194 27.655 1.00 26.44 £ 

him 1972 £ PRO. A 119 42.459 5,959 2$. 999 1.99 24.37 £ 

ATOM 1073 O PRO A 118 41.792 4 .027 26.72 6 1.00 25.34 Q 

30 him 1074 CP prp a 119 *2,955 5i994 29,1$7 1 T 99 23.99 £ 

him 1075 CG PRO A 118 40.847 5.240 29.654 1.00 23.20 £ 

him 197$ CP PRP A 119 39. $95 5-519 29.799 1,99 15.79 £ 

him 1077 N MET a 119 43. 684 5,229 2$,432 1-99 1$.99 H 

KEEM CA MET A 119 44.372 4.215 25.644 1.00 lp. 80 £ 

35 him 1979 C MET A 119 45.9$2 3.993 2$, 444 1.99 23.$1 £ 

him 1999 O MET A 119 46.013 3.281 27.209 1.00 18.02 P_ 

him 1991 CP MET a 119. 45,394 4-994 24,791 1.Q0 13,52 £ 

him 1082 CG MET A 119 44.801 6.014 23.989 1.00 18.52 £ 

him 1083 SD MET A 119 46.157 7,954 23,271 1,99 2$. 27 £ 

40 ATOM 1084 CE MET A 119 46.264 6.524 21.845 1.00 33.79 £ 

ATOM 1085 N ALA A 120 44.559 1.875 26.271 1.00 26.64 H 

him 1086 CA A LA A 120 45,177 0.712 26.884 1.00 29.17 £ 

him 1087 C ALA A 12p 46.356 0.308 25.984 1.00 23.21 £ 

him 1088 O A LA A 120 46.439 0.759 24.833 1.00 20.19 Q 

45 him 1089 CB ALA A 120 44.169 - 0.419 26.944 1.00 26.02 £ 
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A T.C M 1090 M czi.Ji ft 171 47.238 -0. 553 26.507 1.00 12.30 H 

AX2M 1091 CA GLU A 121 48.427 -1.009 25.78B 1.00 9.45 £ 

AXO* 1092 C, ffi,V A 171 49.070 -1,597 24,150 l.QQ 11. 99 £ 

ATOM 1093 O GLU A 121 48.828 -1.670 23.450 1.00 14.84 £ 

5 hXm 1094 CB gLV A 121 49.321 -1.003 29.715 1.00 19.74 2 

ATOM 1095 CG GLU A 121 50.132 -1.122 27.763 1.00 18.14 £ 

1096 CD GLU A 121 49.458 -1.000 29.137 1.00 13.00 £ 

AIQM 1097 OE1 GLU A 121 48.252 -1.294 29.276 1.00 20.79 Q 

ATOM 1098 OE2 GLU A 121 50.123 - 0.521 30.080 1.00 17.86 £ 

10 ATOM 1099 W SER A 122 46.887 -2.273 24.409 1.00 11.79 H 

AJJ3M Uflfl CA SER A 122 49.427 -2,977 23.219 1, 00-12. It £ 

AICM 1101 C SER A 122 46.030 -2.058 22.100 1.00 11.70 £ 

AISM 1102 9 SEE A 122 45.717 -2,529 21.010 1,00 13,91 Q 

ATOM 1103 CB SER A 122 4 5.186 -3.781 23.568 1.00 21.50 £ 

15 &IC£ 1104 OG SER A 122 44.143 -2.908 23.976 1.00 28.52 Q 

AlfiM HQS . N gLV A 123 49,041 -0,754 22,341 1,00 14-6$ M 

AX2H 110$ CA gLV A 123 45.783 0.202 21.243 1.00 17.15 £ 

ATJ2M 1107 C gLV A 123 49.95? 0, 313 20 .240 1,00 11.49 £ 

ATOM 1108 O GLU A 123 46.821 0.844 19.141 1.00 11,19 £ 

20 ATOM 1109 CP gLV A 123 45.481 1.600 21.805 1.00 21.66 £ 

Aiai 1110 CG GLU A 123 44.127 1.694 22.523 1.00 24.68 £ 

ATOM 1111 CD GLU A 123 42.984 1.374 21.585 1.00 35.56 £ 

AIQM 1112 OE1 GLU A 123 43.019 1.865 20.4 26 1.00 41.73 £ 

AXQM 1113 OE2 GLU A 123 42.158 0.497 21.940 1.00100.00 O 

ATOM 1115 CA LEU A 124 49.296 -0.082 19.740 1.00 15.32 _£ 

AIQM lug c LEV A 124 49,Q82 -0.754 18,459 1-00 17.7$ C 

ATOM 1117 . Q LEU A 124 48.752 -1.917 18.44S 1.00 18.91 _Q 

Am 1119 cp LEV A 124 50,554 -Q,990 20,392 1,00 19,07 , .£ 

30 ATOM 1119 CG LEU A 124 5 1.922 -0.222 19.803 1.00 21.52 £ 

ATOM 1120 CD1 LEU A 124 52.080 1.258 20.117 1.00 20.35 £ 

ATOM 1121 CP2 LEV A 124 53,042 -0.919 20.550 1.00 14.07 £ 

him 1122. H LEU A 125 49.514 -0.071 17.409 1.00 18.44 g 

ATOM 1123 CA LEU A 125 49.445 -0.564 16.052 1.00 19.92 £ 

35 ATOM 1124 C LEU A 125 48.034 -0.754 15.509 1.00 25.56 £ 

ATOM 1125 O LEU A 125 47.854 -1.188 14.364 1.00 18.26 Q 

ATOM 1126 CB LEU A 125 50.355 -1.800 15.840 1.00 20.79 £ 

ATOM 1127 CG LEU A 125 51.890 -1.511 15.778 1.00 17.21 £ 

ATOM 1128 CD1 LEU A 125 52.744 -2.649 16.316 1.00 19.95 £ 

40 ATOM 1122 CP2 LEV A 125 5?, 334 -1.219 14.339 1-00 5,91 £ 

ATOM 1130 N GLW A 126 47.027 -0.327 16.276 1.00 21.97 H 

ATOM H31 CA GLN A 126 45.652 -0.504 15.790 1.00 19.97 £ 

ATOM 1132 C GLN A 126 45.213 0.447 14.724 1.00 28.31 £ 

ATOM 1133 O GLN A 12 6 44.076 0.391 14.293 1.00 47.49 £ 

45 ATOM 1134 CB GLN A 126 44.652 -0.404 16.911 1.00 19.87 _£ 
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ATOM 1135 CG GLN A 126 44.949 -1.312 Tft.048 1.00 18. 3Q £ 

ATOM 1136 CD GLN A 126 44.319 -2.626 17.R35 l.OOfifi.flO £ 

ATOM 1137 OE1 GLN A 126 44.064 -3.376 18.792 1.00 40.75 £ 

ATOM 1138 NE2 GLN A 126 44.015 -2.952 16.565 1.00 71.74 ^ 

5 ATOM 1139 N GLY A 127 46.080 1.330 14.270 1.00 28.29 ft 

atom 1140 ca gly a 127 45,527 2,250 13,252 Lu&fl 23.31 £ 

ATOM 1141 C GLY A 127 46.662 3.315 12.953 1.00 22.90 £ 

ATOM 1142 O GLY A 127 47.75? 3.251 13.474 1.0? 25,30 Q 

ATJ2H 1143 y THR A 129 43.311 4.219 12,043 l.QQ 19-51 h 

10 lOm 1144 CA THR A 129 47.149 5.314 11.5 88 1.00 22.12 £ 

ATPM 1145 C THR A 129 47.705 6.219 12.695 1.00 22.60 £ 

ATOM 114g C THR A 129 47.061 6.461 13.731 1.00 IB. 58 Q 

ATOM 1147 CB THR A 128 46.392 6.182 10.54 4 1.00 35.98 £ 

ATOl 1148 OG1 THR A 128 46.533 5.594 9. 239 1.00 58.05 £ 

15 AXOH 1149 CS2 THR A 129 46.942 7.639 10.542 1.00 43.41 £ 

ATPM 1150 p LEV A 12? 48.907 6.715 12.425 1.00 18.32 N 

ATOM 1151 CA LEV A 129 . 49.674 7.534 13.3$6 1.00 16.76 £ 

ATOM 1152 C LEU A 129 49.504 8.959 1 2.967 1.00 4.89 £ 

ATPH 1153 Q LEU A 129 49.232 9.260 11.814 1.00 16.14 £ 

20 ATOM 1154 CB LEU A 129 51.205 7.191 13 .261 1.00 17.91 £ 

A1S21 1155 CG LEU A 129 51.769 5.804 13.752 1.00 18.21 £ 

ATOM 115$ CP1 LEV A 12? 53.132 5.379 13.193 1.00 12.12 £ 

AIPH 1157 CD2 LEU A 129 51.683 5.532 15.251 1.00 3.89 £ 

U59 K <51V A 130 49.913 9.927 13.911 1,0Q 10,23 E 

25 anai 1159 ca clv a 13P 49.912 u.239 13.391 1.00 13,22 £ 

AISM 1130 C GLV A 134 51.128 11.544 12.775 1.00 23.44 £ 

ATOM. 1131 0 GLU A 130 52.249 11.162 13.090 1.00 21.23 Q 

AICH 1162 CB GLU A 130 50.150 11.979 15.035 1.00 18.48 _£ 

ATOM 1163 CG GLU A 130 50.754 13.376 14.886 1.00 77.44 £ 

30 ATOM 1164 CD GLU A 130 49.833 14.328 14.121 1. 00100. OQ £ 

ATOM 1165 OE1 GLU A 130 48.588 14.205 14.340 1.00 36.19 Q 

ATOM 1133 PE2 gLV A 13Q 50,347 15.131 13.295 1,00 21.03 Q 

AIPH 1137 N PRO A 131 50,920 12.219 11.348 1,00 21,35 H 

ATPU 1139 CA PRQ A 131 52.023 12.4 09 10.731 1.00 14.78 £ 

35 &X£M. 1169 C PRO A 131 53.201 13.132 1 1.265 1.00 14.98 £ 

ATOM 1170 O PRO A 131 54.325 12. 847 10.853 1.00 20.99 Q 

ATOM 1171 CB PRO A 131 51.413 13.154 9.552 1.00 14.76 _£ 

AXPH 117? Cg PRO A 131 50,071 1 3 .495 9 .949 1-00 20.99 £ 

ATOM 1173 CD PRO A 131 49.641 12.6 26 11.047 1.00 17.25 £ 

40 AXQH 1174 n THR A 132 52.995 14-095 12.159 1.00 19.77 H 

ATOM 1175 CA THR A 132 54^131 14.838 12.689 1.00 16.44 £ 

ATOM. 117(? c THR A 132 55.102 13.951 13.409 1,00 21.91 £ 

ATOM 1177 O THR A 132 56.317 14.088 13.23 4 1.00 24.17 Q 

ATOM 1178 CB THR A 132 53.716 15.907 13.606 1.00 23.45 £ 

45 ATJ2J 1179 OGl THR A 132 52.976 16.883 12.850 1.00 31.15 Q 
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ATOM 122? . 0 HE A US 5? » 30.2 2.377 14.970 l.oo 13.79 £ 

ATOM 1226 CB ILE A 13B 58.064 4.172 12.316 l.QQ 17.85 £ 

ATOM 1227 CGI TLB A 138 56.6BO 4.473 11.7S7 1.00 28.21 _£ 

ATOM 1228 C<?2 m A 138 58.S74 2.98$ 11. SO? 1,99 9.83 £ 

5 ATOM 1229 CD1 ILE A 138 », W 3.37$ 11,979 1.9.9 18.17 £ 

ATOM 1230 W ALA A 139 60.243 4.361 14.625 1,00 11.54 N 

ATCM 1231 CA ALA A 139 61.494 3,937 15.288 1.00 13.22 _£ 

ATOM 123 2 C ALA A 139 fil,^ 3.364 16.675 1.00 18.73 £ 

ATCM 1233 O ALA A 139 61.791 2.318 17.031 1.00 20.44 , £ 

10 ATOM 123 4 CB ALA A 139 62.434 5.073 15.390 1.00 13.62 _j£ 

ATOM 1235 N LYSA 140 60.397 4.033 17.448 1.00 16.36 H 

AXCM 1236 CA LYS A 140 60.083 3.600 18.815 1.00 15.14 £ 

ATOM 1237 C LYS A 140 59.392 2.262 18.824 1.00 15.18 £ 

AXfiM 1238 Q LYS A HQ $9,824 1,346 19,475 1,99 21.42 Ql 

15 ATOM 1239 CB LYS A 140 59.193 4.606 19.525 1.00 17.86 £ 

ATJ33 1240 CG LYS A 140 59.925 5.806 20.152 1.00 21.11 £ 

ATCM 1241 CD LYS A 140 61.208 5.478 20.958 1.00 16.75 £ 

ATOM 1242 CE LYS A 14p 61.664 6.735 21.835 1.00 10.06 £ 

ATOM 124.3 tTC LYS A 119 52.S88 5.49$ 22,921, l.QQ 14. .49 H 

20 AXQH 1244 N ILE A 141 58.356 2.116 18.027 1.00 11.49 : H 

ATOM 1245 CA ILE A 141 57.703 0. 828 17.977 1.00 17.92 , £ 

ATCM 124$., Q ILE A 141 58, 7?p -0,282 17.977 1,99 13. 4E : £ 

A12M 1247 O ILE A 141 58.730 -1.374 18.148 1.00 13.92 _£ 

ATOM 1248 CB ILE A 141 56.497 0.925 17.019 1.00 22.59 £ 

25 AIOl 1249 CGI ILE A 141 55.466 1,906 17, $57 l f Q0 17, El _ £ 

ATCM 1250 CG2 ILE A 141 55.863 -0.411 16.700 1.00 10.49 £ 

ATCM 1251 CD1 ILE A 141 54.530 2.327 16.449 1.00 13.43 £ 

ATOM 1252 N ALA A 142 5 9.637 0.028 16.650 1.00 10.29 H 

ATCM 1253 CA ALA A 142 60.657 -0.9 31 16.228 1.00 7.15 £ 

30 ATOM 1254 C ALA A 142 61.456 -1.301 17.456 1.00 16.58 £ 

ATCM 1255 O ALA A 142 61.839 -2.454 17.621 1.00 13.04 2 

ATCM 1256 CB ALA A 142 6 1.604 -0.288 15.130 1.00 4.44 £ 

ATCM 1257 N GLY A 143 6 1.703 -0.307 18.316 1.00 9.56 N 

ATCM 1258 CA GLY A 143 62.448 -9,525 19, 5?"* 1.99 5. IS £ 

35 ATOM 1259 C GLY A 143 61.770 -1.555 20.430 1.90 16,36 £ 

ATCM 1260 O GLY A 143 62.39? -2.482 29,967 1,99 14.11 , 2 

ATCM 126 1 N ILE A 144 60.476 -1.418 29.564 1.99 29.33 H 

ATOM 1262 CA ILE A 144 59.725 -2.314 21.407 1.00 15.35 £ 

ATCM 1263 C ILE A 144 , 59.706 -3.732 29. 1.99 19.84 £ 

40 ATCM 1264 O ILE A 144 5 9.836 -4.700 21.608 1.00 17.93 Q 

ATCM 1265 CB ILE A 144 58.317 -1. 819 21.559 1.00 10.60 , £ 

ATCM 1266 CGI ILE A 144 58.311 -0.610 22.516 l.QQ 9.80 £ 

ATCM 1267 CG2 ILE A 144 5 7.410 -2.928 22.122 1.00 9.60 , £ 

ATCM 1268 CD1 ILE A 144 57.022 0.076 22.517 1.00 18.32 ; £ 

45 ATOM 126 9 N LYS A 145 5 9.520 -3.841 19.556 1.00 7.20 U 
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ATOM 1315 CE1 TYR A 150 67.4^7 -R.390 19.27fi i.QQ n. 9 1 £ 

ATOM 1316 CE2 TYR A 150 69.19B -9.682 20.34^ 1 .00 11.10 £ 

ATOM 1317 CZ TYR A 150 68.804 -R.900 19.3^ 6 1.00 20.95 £ 

AX3E 1318 OH TYR A 150 69.739 -8,685 18.333 1.00 27.71 _Q 

5 ATOM 1319 N ASN A 151 63.536 -11.249 23.801 1.00 14.83 - H 

ATOM 1320 CA ASN A 151 62.903 -11.889 24.937 1.00 23.62 £ 

AX2M 1321 C ASN A 151 62.417 -13.244 24.410 1.00 28.53 £ 

ATOM 1322 O ASN A 151 62.630 -14.248 25.072 1.00 25.89 £ 

ATOM 1323 CB ASN A 151 61.655 -11.113 25.41 9 1.00 20.95 £ 

10 AJOM 1324 CG ASN A 151 61.988 -9.867 26.28 4 1.00 15.07 £ 

AX922 1325 QD1 ASN A 151 61.126 -9.020 26.466 1.00 26.72 £ 

anai 1326 np2 asn a isi 63.231 -9.709 26.700 1.00 6.31 h 

AX2M 1327 N ARG A 152 61.731 -13.249 23.259 1.00 19.91 H 

AI34 1326 CA ARG A 152 €1.129 -14.465 22.687 1.00 17.62 £ 

15 AXfiM 1329 C ARG A 152 62.090 -15.523 22.188 1.00 21 .34 £ 

AI2H 1330 O ARG A 152 61.959 -16.687 22 .542 1.00 15.44 Q 

ATPtt 1331 CB ARG A 152 60.086 -14.148 21.610 1.00 15.30 £ 

ATOM 1332 CG ARG A 152 58.672 -13.754 22.157 1.00 17.22 £ 

1333 CP ARG A 152 57.652 -13.297 21. 049 1.00 9.11 £ 

20 ATOM 1334 NE ARG A 152 57.161 -14.419 20.241 1.00 21.05 K 

AX&tt 1335 CZ ARG A 152 57.159 -14.447 18.912 1.00 28.61 £ 

AI2M 1336 NH1 ARG A 152 57.590 -13.387 18.221 1.00 21.98 H 

ATOtt 1337 NH2 ARG A 152 56.717 -15.528 18.262 1.00 26.11 N 

AXS25 -133ft U GLK A 153 63.098-15.104 21.434 1.00 16.54 

25 Alffll 1339 CA GLN A 153 64.044 -16.036 20.842 1.00 9.74 £ 

AlfiM 1340 £ GLN A 153 $$.Qg2 21rP07 l.Op IS. 70 £ 

AIGM 1341 O GLN A 153 65.529 - 17.545 21.763 1.00 24.35 ; £ 

AX2M 1342 CB GLN A 153 64.789 -15.372 19.714 1.00 8.99 £ 

ATOM 1343 CG GLN A 153 65.935 -16.225 19.116 1.00 4.63 £ 

30 ATOM 1344 CD GLN A 153 66.315 -15.637 17.762 1.00 14.17 £ 

ATOM 1345 QE1 GLN A 153 65.611 -14.763 17.254 1.00 12.53 £ 

ATOM 1346 NE2 GLN A 153 67.466 -16. 024 17.228 1.00 13.38 H 

ATOM 1347 N TYR A 154 65.566 -15.518 22,SPg 14,35 M 

AXCM 1348 CA TYR A 154 66.677 -15.839 23.483 1.00 12.16 £ 

35 AI2M 1349 C TYR A 154 66.323 -15.930 24.954 1.00 19.06 £ 

him 1350 O TYR A 154 67.185 -16.207 25.777 1.00 25.59 Q 

ATOM 1351 CB TYR A 154 67.829 -14.816 23.326 1.00 16.89 £ 

ATOM 1352 CG TYR A 154 68.418 -14. 733 21.943 1.00 17.53 £ 

ATOM 1353 CP1 TYR A 154 69.259 -15.726 21.467 1.00 18.91 £ 

40 AXQM 1354 CH2 TYR A 154 68.080 -13.712 21.091 1.00 13.97 £ 

ATOM 1355 CE1 TYR A 154 69.782 -15.686 20.190 1.00 10.98 £ 

ATOM 1356 CE2 TYR A 154 68.621 -13.639 19.806 1.00 23.81 £ 

AXQM 1357 CZ TYR A 154 69.486 -14.634 19.380 1.00 23.08 , _£ 

AXP24 1358 OH TYR A 154 70.002 -1 4.619 18.118 1.00 23.87 £ 

45 AIQM 1359 N GLY A 155 65 . 080 -1 g - 686 25.313 1.00 12.08 H 
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ATOM 1360 CA GLY A 155 64.747 -15.702 26.731 1.00 15.80 £ 

Ktm 13S1 C SI<Y A 1$5 E?,?23 -1M98 27 t 58Q i t Qp ?3.?7 £ 

ATOM 1362 O GLY A 155 65.491 -14.640 28.789 1.00 25.76 Q 

ATOM 1363 N ARG A 156 65.564 -13.318 26.981 1.00 ^5.91 H 

5 ATOM 1364 CA ARG A 156 66.066 -12.146 27.73 4 1.00 14.13 £ 

ATOM 1365 C ARG A 156 64.971 -11.486 28.5B1 1.00 16.23 £ 

AIQM 1336 Q ARC A 1$$ 63,992 '11,91? 29,393 1.99 22,63 2 

AXSM 1367 CB ARE A 156 66.601 -11.124 26.750 l.OO 13.16 £ 

ATOM 1368 CG ARG A 156 67.875 -11.570 26.099 1.00 15.18 £ 

10 AXJ2M 13fi9 CP ARE A 136 69,939 -11.119 27,121 1,Q9 26.12 £ 

AI25 1379 MB.. ARG ft 70.200-11.912 26.633 1.0O21.2S H 

ATOM 1371 CZ ARG A 156 71.092 -12.555 27.386 1 .00 42.25 £ 

ATOM 1372 NH1 ARG A 156 70.870 -12.795 28.679 1.00 20.02 H 

AXSM 1373 NH2 ARG A 156 72.221 -12.966 26.843 1.00 20.88 ft 

15 ATOM 1371 N ASP A 157 65.343 -10.446 29.321 1.00 16.00 H 

MEM 1375 CA ASP A 137 64.370 -9.749 30.166 1.00 16.20 £ 

AXSM 1376 C ASP A 157 64.444 -8.245 29.841 1.00 19.20 £ 

AICM 1377 Q ASP A 157 64.865 -7.429 30.650 1.00 10.71 2 

AIQM 1378 CB ASP A 157 64.609 -10.061 31 .652 1.00 16.50 £ 

20 ATOM 1379 CG ASP A 157 63.4B9 - 9.560 32.566 1.00 26.45 £ 

AlSSa 1399 SRI ASP A 157 62.433 -9.060 32.108 1.00 26.82 Q 

AI2M 1391 QP2 ASP A 137 63.673 -9.653 33.78 4 1.00 21.88 £ 

AXCM 1392 U TYR A ISP 64.038 -7.921 28.620 1.00 19.41 H 

ATOM 1383 CA TYR A 158 64.099 -6.564 28.083 1.00 18.96 C 

- ox iwwrriTir -ttttt gj ttt n i-™- ^^m^^^£^^^^ ^^^^^i — — ■ 

A123 1?8? Q TYR A 159 61.854 -6.296 27.282 1.00 10.12 O 

ATOM 1396 CB TYR A 159 64.562 -6.661 26.631 1.00 16.34 _£ 

ATOM 1387 CG TYR A 158 65.982 -7.166 26 .484 1.00 12.04 £ 

ATOM 1388 CD1 TYR A 158 66.789 -7.415 27.621 1.00 13.76 £ 

30 ATOM 1389 CD2 TYR A 158 66.544 -7.349 25.218 1.00 16.35 £ 

ATOM 1390 CE1 TYR A 158 68.135 -7.786 27.482 1.00 8.18 £ 

ATPM 1391 CE2 TYR A 159 67.886 -7.732 25.060 1.00 13.73 £ 

AI2M 1392 CZ TYR A 158 68.676 -7.942 26. 186 1.00 24.45 £ 

AI2M 1393 OH TYR A 158 69.993 -8.338 25.997 1.00 14.36 _Q 

35 A1QM 1394 N ARG A 159 62.423 -5.200 29.175 1.00 23.53 H 

ATOM 1395 CA ARG A 159 61.105 -4.603 29.483 1.00 21.15 C 

ATSM 1396 C ARG A 159 60.930 -3.172 28 .878 1.00 23.55 £ 

AIPH 1397 Q ARG A 159 61.911 -2.566 28.424 1.00 18.12 £ 

ATOM 1398 CB ARG A 159 60.891 -4.608 31.034 1.00 21.68 \ £ 

40 ATOM 139 9 CG ARG A 159 60.986 -6.029 31.722 1.00 16.41 £ 

ATCW UPQ CP ARE A 159 61.135 -S.952 33,233 1,99 19,19 £ 

ATOM 1401 NE ARG A 159 61.305 -7.402 33.772 1.00 19.25 £ 

ATOM 140 2 CZ ARG A 159 61.164 -7.720 35.058 1.00 36.67 ] £ 

ATOM 1403 NH1 ARG A 159 60.886 -6.776 35.962 1.00 15.32 fi 

45 ATOM 1404 NH2 ARG A 159 61.309 -8.986 35.448 1.00 11.79 H 
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ATOM 1450 C LEU A 166 56 -62? 16.017 31.120 1.00 25.05 £ 

ATOM 1451 O LEU A 166 56.624 16.718 30.125 1.00 25.09 Q 

ATOM 1452 CB LEU A 166 57.460 13.743 30.870 1.00 17.46 £ 

ATOM 1453 CG LEU A 166 57.423 12.218 30.652 1.00 16.63 £ 

5 ATOM 1454 CD1 LEU A 166 58.837 11.639 31.000 1.00 22.52 £ 

ATOM 1455 CD2 LEU A 166 56.336 11.539 31.514 1.00 7.46 £ 

ATOM 1456 W TYR A 167 57.146 16.391 32. 30Q_ 1.00 19.78 H 

ATOM 1457 CA TYR A 167 S 7.678 17.760 32.511 1.00 18.58 £ 

ATOM 1458 C TYR A 167 58.53j 17.763 33 ,767 1,00 15.53 £ 

10 ATOM 1459 O TYR A 167 58.474 16.652 34.575 1.00 16.71 Q 

ATOM 146ft CB TYR A 167 56.509 1 8.778 32.665 1.00 18.33 £ 

ATOM 1461 CG TYR A 167 55.671 18.561 33.931 1.00 14.23 £ 

ATOM 1462 CD1 TYR A 167 5 4-624 17.618 33.977 1.00 13.35 £ 

ATOM 1463 CD2 TYR A 167 55.984 19.258 35.106 1.00 16.52 , £ 

15 ATOM 1464 CE1 TYR A 167 53 .889 17.446 35.146 1.00 21.17 £ 

ATOM 1465 CE2 TYR A 167 5 5.302 19.084 36.264 1.00 8.26 £ 

ATOM 1466 C2 TYR A 167 5 4.22B 18.203 36.296 1.00 23.56 £ 

ATOM 1467 PH TYR A 1^7 53.526 18.Q79 37.504 1.00 22, 83 2 

ATOM 1468 N GLY A 168 59.334 18.797 33.952 1.00 16.59 H 

20 A2P24 1469 gft <?1Y A 168 60.159 19.817 35,152 1,90 19,23 £ 

ATOM 1470 C GLY A 168 61.534 19.428 34.880 1.00 13.69 £ 

ATOM 1471 O GLY A 168 61.746 20.028 33.837 1.00 16.52 £ 

ATOM 1472 W PRO A 369 62.473 19.263 35,817 1.00 20.33 If 

ATOM 1473 CA PRO A 169 63.801 19.822 35.656 1.00 16.07 £ 

^^WWKriTlT^lW'il 1T 1' I ' RRO-A»>hf~ § | L4^30^^9^53^3.4^38,7^^^n■ m-i.q r > 

AXOfl 147$ p pro a 64,305 18,186 33.981 1.00 21.23 £ 

ATOM 1476 CB PRO A 169 64.595 19.206 36.805 1.00 17.28 £ 

ATOM 1477 CP PRQ a 269 63,649 19,919 37.930 1,00 19,8? £ 

ATOM 1478 CP PRO A 169 62.263 18.7 72 37.189 1.00 22.47 £ 

30 ATOM 1479 N HIS A 170 6 5.226 20.235 33.829 1.00 19.48 M 

ATOM 1480 CA HIS A 170 65.952 19.877 32.638 1.00 25.56 £ 

ATOM 1481 C HIS A 170 65.096 19.707 31.428 1.00 29.15 £ 

ATOM 1482 O HIS A 170 65.553 19.091 3C.479 1.00 29.71 \ Q 

ATOM 1493 CB HIS A 170 66.793 19,600 32.845 1.00 28,94 £ 

35 ATOM 1484 CG, HIS A 170 67.703 18.671 34.034 1.00 33.88 £ 

ATOM 1485 ND1 HIS A 170 68.975 19.203 33.969 1.00 25.46 H 

ATOM 1486 CD2 HIS A 170 67.518 18.298 35.326 1.00 34.77 £ 

ATOM 1487 CE1 HIS A 170 69.531 19.151 35.166 1.00 25>63 £ 

ATOM 1488 NE2 HIS A 170 68.673 18.603 36.008 1.00 31.72 H 

40 ATOM 1489 N ASP A 171 63.881 20.245 31.440 1.00 21.52 H 

ATOM 1490 CA ASP A 171 63,041 20.267 30,219 I1PP 28,63 . £ 

ATOM 149 1 C ASP A 171 63.630 21.459 29.359 1.00 41.94 £ 

ATOM 1492 O ASP A 171 64.534 22.17 1 29.835 1.00 29.69 £ 

ATOM 1493 CB ASP A 171 61.552 20.558 30.602 1.00 26.40 £ 

45 ATOM 1494 CG ASP A 171 60.552 20.097 29.540 1.00 22.32 £ 
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ATOM 1495 ODl ASP A 171 60.890 20.067 28.325 1.0O 32.03 Q 

ATOM 1496 OD2 ASP A 171 59.427 19.719 79.916 1.00 42.13 Q 

ATOM 1497 N ASM A 172 63.141 21.712 28.137 1.00 42.08 

ATOM 1498 CA ASN A 172 63.616 22.693 27.388 1.00 35. £ 

5 ATOM 1499 C ASN A 172 62.665 24,056 27.674 .1,00 33.73 £ 

ATOM 1500 O ASN A 172 61.586 24.102 27.104 1.00 32.69 £ 

at cm 1501 cb asn a 172 63.632 22.667 25.869 1.00 gJLufifl £ 

ATOM 1502 CG ASN A 172 63.807 23.987 25.086 1.00 39.09 £ 

ATOM 1503 OPI ASN A 172 62.973 21.347 24.259 1.00 83.94 Q 

10 ATOM 1504 ND2 ASN A 172 64.855 24.740 25.418 1.00 65.07 N 

ATOM 1?Q5 N EBE A 173 63,921 24,953 29.563 1.00 31.93 U 

ATOM 1506 CA PHE A 173 62.082 26.030 28.944 1.00 48.24 C_ 

AIQM 1507 c PHE A 173 $1,99? 27,260 28.045 1.00 69.03 £ 

ATOM 1508 Q PHE A 173 62.27ft 28.395 28.465 1.00 58.79 £ 

15 ATOM 1509 CB PHE A 173 62.225 26.459 30.390 1.00 43.43 £ 

ATOM 1510 CG PHE A 173 61.867 25.399 31.356 1.00 34.19 £ 

ATOM 1511 CD1 PHE A 173 62.810 24.488 31.751 1.00 24.68 £ 

Aiai 1512 CD? PHE A 173 60. $21 25.354 31.925 1.00 24,94 £ 

ATOM 1513 £E1 PHE A 173 62.524 23.549 32.682 1.00 23.64 £ 

20 ATOM 1514 CE2 PHE A 173 60.305 24.366 32.804 1.00 31.32 £ 

ATOM 1515 CZ PHE A 173 61.263 23.457 33.192 l.QQ 24.30 £ 

ATOM 1516 N HIS A 174 61.510 27.036 26.831 1.00 68.16 H 

ATOM 1517 CA HIS A 174 6 1.401 28.109 25.871 1.00 64.53 £ 

ATOM 1518 C HIS A 174 59.973 28.221 25.400 1.00 71.58 £ 

25 AXQM 1519 O HIS A 174 59.309 27.186 25.249 1.00 73.20 £ 

ATOM 1520 CB HIS A 174 62.418 27.870 24.736 1.00 71.71 £ 

ATOM 1521 CG HIS A 174 63.83S 27.8 68 25.229 1.00 92.29 £ 

ATOM 1522 ND1 HIS A 174 64 .921 27.539 24.440 1.OO1O0.0O H 

ATOM 1523 CD2 HIS A 174 64.338 28.133 26.463 1.00100.00 £ 

30 ATOM 1524 CE1 HIS A 174 66.032 27.628 25.160 1.00100.00 £ 

ATOM 1525 NE2 HIS A 174 65.705 27.981 26.393 1.00100.00 U 

ATOM 1526 N PRO A 175 59.469 29.461 25.262 1-00 65,73 U 

ATOM 1527 CA PRO A 175 58.109 29.658 24.770 1.00 55.72 £ 

ATOM 1528 C PRO A 175 S B. 233 29.297 23.267 1.00 75.83 £ 

35 ATOM 1529 O PRO A 175 57.224 29.226 22.554 1.00 69.59 P_ 

ATOM 1530 CB PRO A 175 57.866 31.142 25.0?6 1-OQ 49-14 £ 

ATOM 1531 CG PRO A 175 59.258 31.790 24.901 1.00 42.23 £ 

ATOM 1532 CD PRO A- 175 6 0.286 30.695 25.109 1.00 49.59 £ 

ATOM 1533 N SER A 176 5 9.480 28.954 22.879 1.00 85.09 H 

40 ATOM 1534 ca SEA A 17$ 59.954 29,474 21.54S 1,00 81,19 £ 

ATOM 1535 C SER A 176 59.660 26.965 21 ■ 343 " 1. 00 73 . 90 £ 

ATOM 1536 O SER A 176 59.617 26.459 20.213 1,00 57-03 ; Q 

ATOM 1537 CB SER A 176 61.493 28 .666 21.447 1.00 71.32 £ 

AIQH 1538 QQ SER A 176 6g t Q48 29.349 22,579 1-00 51.93 £ 

45 ATOM 1539 N ASN A 177 59.520 26. 276 22.480 1.00 66.23 H 
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AX01L 1549 CA ASN A 177, 

AIM 1541 C asn a 177 



59.274 24,947 22.619 liOO 56,41 

57.810 24 .497 22.353 1.00 60.91 



ATOM 1542 O ASN A 111 



56.914 25i215 22,911 1.00 55.59 



ATOM 1543 CB ASN A 177 



59.619 24. 469 24.065 1.00 50.45 



AIPH 1544 Cg ASK A 122 



59.562 22.970 24,319 1.00 66.57 



ATOM 1545 QD1 ASN A 177 



59.095 22.216 23.476 1.001O0.00 



ATOM 1546 NP2 ASN A 177 



60.099 22.546 25.464 1.00 35.61 



A TOM 

ATOM 



1547 H SER A 179 



57.583 23.3B7 21.627 1.00 57.10 



1549 



10 



ATOM 1549 



CA SER A 178 

£ SEE ..A 179 



56.234 22.853 21.279 1.00 50.50 



55.557 22.159 22.491. J^flfl 76. &4 



ATOM 1550 o SER A 179 



54.575 21.4QQ 22.304 1,00 99,63 



ATOM 1551 CB SER A 178 



56.316 21.800 20.118 1.00 10.^7 



AXCM 1552 ,.gg SER A 179 



57.397 22.112 19.217 1.00 71.69 



ATOM 1553 N HIS A 179 



56.134 22.284 23.694 1.00 37.39 



15 



Aiai 1554 CA HIS A 179 



55.569 21. 587 24.855 1.00 30.96 



AXSM 1555 C HIS A 179. 



54.961 22.616 25.767 IrOO 21.93 



ATOM 1556 O HIS A 111 



S5.641 23.598 26.138 1.00 25.17 



ATOM 1557 CP HIS A 179 



5 6,634 , 20,693 25.575 1,00 3^,20 



AT9M 1559 Cg ■ HIS A 179 



56.973 19.419 24.935 1-00 42.90 



20 



AIQtt 1559 mi HIS A 179 



56.973 19. 335 23.45 7 1.00 49.52 



ATOM 1560 CD2 HIS A 179 



S71323 18.190 25.278 1.00 52.42 



ATOM 1561 CE1 HIS A 179 



57,283 18, 109 23.QB4 l-Qp 44,78 



ATOM 1562 NE2 HIS A 179 



57.500 17.393 24.168 1.00 50.49 



AE2M 1563 N VAI A 190 



53.661 22.454 26.039 1 T QQ 19,14 



30 



35 



40 



ATOM 1583 CG PRO A 182 



57.925 22.473 28.471 1.00 25.77 



45 ATOM 1584 g P PR0 A 182 



56.727 22.359 29.401 1.00 18.23 
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19.55 


P 


ATOM 


1567 


CB ■ 


VAL A 180 


51.403 


23.115 


26.914 


1.00 


35.47 


,C 


ATOM 


1569 


CGI 


VAL A 180 


50 j 630 


24,399 


27.217 


1,00 


35-94 


c 


ATOM 


1569 


CG2. 


VAL A 190 


50*923 


22.550 


25.663 


1.00 


36.11 


c 


AKW 


1570 


VL 


HE A 181 


537694 


22,935 


29.005 


ItOO 


26.57 




ATOM 


1571 


CA 


ILE A 191 


54.139 


23.285 


30,360 


1,00 


24,49 


c 


ATOM 


. 1572. 


C 


ILE A 181 


55.371 


. 24.213 


30,361.. 


1,00 


16.51 


c 


ATOM 


1573 


.0 


ILE A 181 


55.326 


25.315 


30.909 


1.00 


24.42 


p 


ATOM 


1574 


CB 


ILE A 181 


54.285 


22.018 


31.264 


1.00 


20.20 


C 


ATPM 


1575 


CGI II»EA_1B1 . 


52.979 


21.429 


31.529 


1.00 


18.22 


C 


ATOM 


1576 


CS2 


HE A 161 


55.014 


22.315 


32,591 


1.00 13.37 


c 


ATPM 


1577 


QVl 


ILE A 181 


52.867 


20.086 


32.286 


1.00 


8.03 


c 


ATOM 


157?. 


K. . 


PRP A 192 


56.452 


23.779 


29.718 


1.00 


22.21 


N 


ATPM 


1579 


CA 


PRP A 192 


57.664 


^4.605 


29,640 


IrOO 


22,07 


C 


ATPM 


1580 


C. 


PRP A 192 - 


57.37?. 


25.852 


28.828 


1,00 


24.16 


c 


ATPM . 


1591 


P 


PRP A 192 . 


57,911 


26.949 


29,210, 


1.00 


18.35 


P 


ATOM 


1582 


CB 


PRO A 182 


59,69? 


23.725 


29,990 


1,00 24.97 


c 
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AX£H llflj .H ALA A 193 Sfi.6?8 25.707 71.170 i.qq ?1 ,45 u 

ATOM 1586 CA AIA A 183 56.261 26.896 26.9 43 1.00 21.66 C 

AXOM 1597 Q ALA A 193 55.464 27,900 27.811 1.00 26.10 £ 

ATOM 1588 O AIA A 183 55.773 29.091 27.856 1.00 10.50 £ 

5 ATOM 1589 CB ALA A 183 55.473 26.513 25.703 1.00 13.26 C 

ATOM 1590 N LEV A 181 54.472 27.389 28.543 1.00 2^.34 H 

AlflM 1531 CA LEV A Ifii 53.642 28.215 29.401 1.00 19.05 £ 

ATOM 1592 C LEU A 184 54.312 2B. 693 30.655 1.00 21.91 £ 

AXGU LEV A 194 54.017 29.771 31.158 1.00 19.71 a 

10 ATOM 1594 CB LEU A 184 52.309 27.553 29.715 1.0Q 14.41 £ 

ATOM 1595 CG LEU A 184 51.342 27.595 28.525 1.00 23.42 £ 

AXQM 159$ CPl LEV A ISA 49.918 27.244 28.928 1.00 31.06 £ 

ATOM 1597 CP2 LEV A 191 51.380 28.896 27.690 1.00 21.73 £ 

Eta* — E LEV A 195 55.178 27.879 31.213 1.00 18.39 K 

15 AICM 1599 CA LEU A 185 55.833 28.332 32.417 1.00 16.39 £ 

AX0U 1$0P C LEV A 195 56.669 29. 528 31.985 1.00 23.67 £ 

AIPU Ififil Q LEV A 195 56.681 30.590 32.644 1.00 29.38 O 

ATOM 1602 CP LEV A 195 56.723 27.233 33.015 1.00 15.05 £ 

ATOM 1603 CG LEU A 185 56.021 26 .348 34.041 1.00 15.56 £ 

20 aiCM IfiPl CPl LEV A 195 , 56.819 25.022 34.301 1.00 21 T 06 £ 

Km 16Q5 CP2 LEV A 195 55.722 27. 113 35.321 1.00 11.02 £ 

AXOf 16P6 W ARg A 196 57.337 29.397 30.952 1,00 17,09 U 

AlfiM 1607 CA ARg A 196 58.137 30.523 30.4 29 1.00 18.82 £ 

AXfitt 1609 C ARC A 196 52,309 31>752 30.069 . 1.00 29..00 £ 

25 M2M 16Q9 ,.Q ARG A 186 57.629 32.880 30.476 1.00 23.93 Q 

AIPH 1610 CB ARG A 186 59.026 30.146 29. 281 1.00 22.06 £ 

AIPH 1611 C<? ARg A 1?6 59.653 31. 365 28.652 1.00 38.46 £ 

AI9H 1612 CP Afig A 196 €0.825 31.804 29.462 1.00 83.66 

AISM 1613 NE ARG A 186 62.012 31.861 28.631 1.00 70. 77 N 

30 ATOM 1614 CZ ARG A 186 63.058 32.622 28.904 1.00 91.68 £ 

&T.OM 1615 NH1 ARG A 18 6 63.053 33.386 29.995 1.00 56.56 H 

AXQM 1616 NH2 ARg A 196 64,099 32,639 29,092 1.QP1QP.0Q U 

ATOM 1617 W ARg ?l 19.7 56.234 31.544 2 9. 310 1.00 20.?6 U 

A1PH 1619 CA ARg A 197 55.361 32.662 28.941 1.00 19.32 £ 

35 ATOM 1619 C ARg A 187 54.765 33.453 30.142 1.00 28.41 £ 

AXflM 1620 O ARg A 197 54.823 34.700 30.193 1.00 17.23 Q 

AIGK 1621 CP ARg A 197 54-270 32-223 27.957 . lrOQ 17-Q5 £ 

AIPH 1622 CG ARG A 187 54.813 31. 546 26.720 1.00 61.42 £ 

ATPM 1623 CP ARg A 197 53.696 31.244 25-757 1-PQ 44.57 £ 

40 ATPM 1S24 HE ARg A 197 53,03? 32t472 25t354 1-Qp 29.47 K 

ATOM 1625 CZ ARG A 187 51.831 32.534 24.790 1.00 17.82 £ 

ATOM 1626 NH1 ARG A 187 51.136 31.427 24.544 1.00 24.95 & 

ATOM 1627 WH2 ARG A 187 51.341 33.716 24.447 1.00 37.77 & 

AXPtf 1628 N PHE A 188 54.192 32.734 31.101 1.00 23.48 H 

45 ATPM 1629 CA PHE A 199 53.604 33.399 32.259 1.00 21.24 £ 
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ATOM 1630 C PHE A 18fl 54 . 63B 34 T 0fl0 33.095 1 . OQ 91.39 C 

AXfiM 1631, Q PHE A IBB 54.394 35.126 33.626 1 .00 93.90 Q 

him 1632 CB PHE A IBB 52.723 39.466 33.077 1 .00 19.95 £ 

ATOM 1633 CG PHE A IBB 51.3B9 32.215 32.435 1.00 22. 2B £ 

5 him 1634 CD1 PHE A 188 50.440 3 3.229 32.375 3.00 19.42 £ 

ATOM 1635 CD2 PHE A IBB 51.144 31.038 31.734 1.00 23.82 \ £ 

him 1636 CE1 PHE A 168 49.191 33.026 31.742 1.00 24.77 £ 

ATOM 1637 CE2 PHE A 188 49.936 30 .826 31.057 1.00 20.17 £ 

ATOM 1638 CZ PHE A 188 48.945 31.815 31.068 1.00 23.14 £ 

10 ATOM 1639 N HIS A 189 55.831 33.513 33.118 1.00 24.15 ft 

him 1640 CA HIS A 189 56.933 34.122 33.837 1.00 28.79 £ 

him IMJ £ HIS A 189 57.303 35.506 3 3.315 1.00 2B.58 £ 

him 1642 Q HIS A 189 57.480 36.463 34.083 1.00 20.07 £ 

him 1643 CB HIS A 189 58.148 33.268 33.641 1.00 31.36 £ 

15 him 1644 CG HIS A 189 59.364 33.844 34.290 1.00 29.98 £ 

him 1645 ND1 HIS A 189 59.548 33.833 35.658 1.00 31.00 H 

him 1646 CD2 HIS A 189 60.449 34.464 33.766 1.00 21.79 £ 

him 1647 CE1 HIS A 189 60.722 34.371 35.94 5 1.00 24.04 £ 

him 1648 NE2 HIS A 189 61.257 34.815 34 ,821 1.00 19.53 H 

20 him 1649 y GLU A 190 57.539 35.561 32.006 1.00 28.43 H 

him 1659 CA ELV A 199 57,976 36.916 31.324 1,99 27.72 £ 

him mi C GLU A 190 56.725 37.829 31.437 1.00 32.56 £ 

ATOM 1652 O GLU A 190 5 6.949 38.995 31.717 1.00 27.06 £ 

him 1653 CB GLU A 190 58.122 36.529 29.849 1.00 28.55 £ 

? % M a r* ,rn M l$54—<*S—GLV-K~1 Qn - ^ 9^1>5.0^r35^6q~2^61^~1^00^35,,2>9 . - = ^£ 1 

him 1655 CD GLU A 190 60.553 35.941 29.892 1.00 99.81 C 

him 1656 OE1 GLU A 190 60.913 36.037 31.085 1.00 86.56 Q 

him 1657 OE2 GLU A 190 61.293 36.167 28.910 1.00100.00 Q 

him ISSi W ALA a 191 $$,493 37,39; 31.196 1.99 32.67 H 

30 him 1659 CA ALA A 191 54.349 38.286 31.311 1.00 25.30 £ 

him 1669 C ALA A 191 54.287 38.795 32 .742 1.00 36.20 £ 

him 1661 O ALA A 191 53.920 39.924 33.014 1.00 27.52 Q 

him 1662 CB ALA A 191 53,955 37,563 31.990 1,90 16,48 £ 

him 166? y THR A 19? 54,549 ?7 t ?27 33.693 1.99 29t39 M 

35 him 1664 CA THR A 192 54.395 38.386 35.041 1.00 19.08 £ 

him 1665 C THR A 192 55.420 39.494 35.298 1.00 44.78 £ 

him 1666 O THR A 192 55.094 40.550 35.839 1.00 40.58 £ 

him 1667 CS THR A 192 54.515 37.235 3?. 983 1.99 19.99 £ 

him 1668 QG1 THR A 192 53.410 36.348 35.755 1.00 34.36 £ 

40 ATOM 1669 CG2 THR A 192 54.461 37. 738 37.425 1.00 21.15 £ 

him 1679 y ALA A 193 56.617 39.312 34.757 1.99 49.5? U 

ATOM 1671 CA ALA A 193 57.705 40.2 86 34.905 1.00 50.59 _£ 

him 1672 C ALA A 193 57.496 41,613 34.145 g r 9P 54.42 £ 

him 1673 o ALA A 193 57.952 42.699 34.553 1,99 49.29 Q 

45 ATOM 1674 CB ALA A 193 59.047 39.6 40 34.496 1.00 51.78 £ 
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ATOM 1675 N GLN A 194 56.810 41.530 31.D 22 1.00 4? r 1fi 

ATOM 1676 CA, gUL A ISA 56.566 42.722 17.1 42 1.00 38.03 £ 

AIPM 1^77 C gLW A 194 55.264 43.389 32.576 1.00 40.8* £ 

ATOM 1678 O giM A 19.4 54.830 44.284 31.845 1.00 S1.?0 £ 

5 ATOM 1679 CB GLN A 194 56.599 42.358 30.750 1.00 35.96 £ 

ATOM 1680 CG GLN A 194 57.910 41.692 30.290 1.00100.00 £ 

ATOM 1681 CD GLN A 194 57.715 40.661 29.158 1.00100.00 £ 

ATOM 1682 OE1 GLN A 194 56.619 40.546 28.579 1.00100.00 Q 

AI2M 1S83 NE2 <*W A 191 58.782 39.904 28.848 1.00100.00 W 

10 ATOM 1684 N GLY A 19? 54.583 42.949 33.630 1.00 32.29 K 

ATOM 1685 CA GLY A 195 53.236 43.464 33.864 1.00 36.26 £ 

ATOM 168 6 C GLY A 195 52.299 43.332 32.593 1.00 45.33 £ 

ATOM 1687 O GLY A 195 51.515 44.242 32.346 1.00 45.16 £ 

ATOM 1688 N GLY A 196 52.405 42.245 31.78 8 1.00 36.33 H 

15 ATOM 1689 CA GLY A 196 51.515 41.965 30.608 1.00 19.06 £ 

ATOM 1690 C GLY A 196 50.037 41.95B 31.117 1.00 22.49 C 

ATOM 1691 O GLY A 196 49.724 41.479 32.223 1.00 33.09 _Q 

ATOM 1692 N PRO A 197 49.144 42.657 30.4 31 1.00 29.22 H 

ATOM 1693 CA PRO A 197 47.790 42.732 30.953 1.00 25.29 £ 

20 ATOM 1694 C PRO A 197 47.091 41 .413 30.6 74 1.00 24.64 £ 

ATOM 1695 O PRO A 197 46.192 40.991 31.411 1.00 24.75 £ 

ATOM ISPS CB PRQ A 197 47.162 43.911 30.176 1.00 26.31 £ 

ATOM 1697 CG PRO A 197 48.188 44.407 29.252 1.00 26.56 £ 

ATOM 1698 CD PRO A 197 49.307 43.454 29.203 1.00 30.25 £ 

25 ATPtt 1S99 W ASP A 199 47t?72 ^ Q . 723 2?t ^ p fi 

ATOM 1700 CA ASP A 198 47.067 39. 418 29.405 1.00 21.65 £ 

ATOM 1701 C ASP A 199 48.046 38.522 28.677 1.00 31.28 £ 

ATOM 1702 O ASP A 198 49.062 38.978 28.172 1.00 34.57 _£ 

ATOM 1703 CB ASP A 198 45.739 39.507 28.669 1.00 32.80 £ 

30 ATOM 1704 CG ASP A 198 45.868 40.055 27.256 1.00 46.13 _£ 

ATOM 1705 OD1 ASP A 198 46.982 40.230 26.725 1.00 57.45 Q 

ATOM 1706 OD2 ASP A 198 44.817 40.271 26.640 1.00 67.61 £ 

ATOM 1707 N VAL A 199 47.713 37.234 28.614 1.00 38.67 H 

ATOM 1708 CA VAL A 199 48.499 36.226 27.901 1.00 27.79 £ 

35 ATOM 1709 C VAL A 199 47 .462 35.469 27.065 1.00 25.88 £ 

ATOM 1710 O VAL A 1 99 46. 460 35.023 27.598 1.00 24.22 O 

ATOM 1711 CB VAL A 199 49.163 35.229 28.905 1.00 24.37 £ 

ATOM 1712 CGI VAL A 199 49.874 34.047 28.160 1.00 20.28 £ 

ATOM 1713 CG2 VAL A 199 50.121 35. 942 29.835 1.00 22.25 £ 

40 ATOM 1714 N VAL A 200 47.661 35 .386 25.757 1.00 23.72 H 

ATOM 1715 CA VAL A 200 46.701 34.694 24.903 1.00 23.99 £ 

ATPtf im C VAL A 209 47,157 33.296 21,199 l.QQ 22.85 £ 

ATOM 1717 O VAL A 200 4 8.321 33.108 24.188 1.00 29.77 £ 

ATOM 1718 CB VAL A 200 46.358 35. 548 23.680 1.00 23.11 £ 

45 ATOM 1719 CGI VAL A 200 45.561 34.737 22.596 1.00 16.25 £ 
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ATOM 1720 CG2 VAL A 200 45.652 36.823 24.130 1.00 27.86 £ 

ATOM 1721 N VAL A 201 46.296 ?2.278 24.632 1.00 27.39 H 

ATOM 1722 CA VAL A 201 46.588 30.893 24.265 1.00 9.63 £ 

ATOM 1723 C VAL A 201 45.653 30.529 23.165 1.00 19.63 £ 

5 ATOM 172 4 O VAL A 201 44.452 30.755 23.312 1.00 17.61 Q 

ATOM 1725 CB VAL A 201 46.306 29.952 25.426 1.00 19.95 £ 

ATOM 1726 CGI VAL A 201 46.703 28.519 25.054 1.00 20.85 £ 

ATOM 1727 C?2 VAL A 201 47.096 30.439 26.661 1.00 IE, 73 £ 

ATOM 1729 N TRP A 202 46.210 30.080 22.030 1.00 14.36 

10 ATOM 1729 CA TRP A 202 4 5.422 29.693 20.865 1.00 18.97 £ 

AlflM 1739 C TRP A 202 44.495 29,572 21.313 ULfl 36,22 £ 

ATOM 1731 £2 TRP A 202 44.934 27.694 22.057 1.00 31.46 Q 

AXOl 1732 CB TRP A 202 46.292 29.055 19 .823 1.00 19.14 £ 

1733 CG TRP A 202 47.243 29.894 19.066 1.00 33.65 £ 

15 Alffll 1734 CD1 TRP A 202 48.391 29.463 1B.429 1.00 35.28 C 

ATOM 1735 CD2 TRP A 202 47.126 31.282 18.772 1.00 39.90 £ 

AIOl 1736 NE1 TRP A 202 48.941 30.481 17.693 1.00 37.86 H 

AXCM 1737 CE2 TRP A 202 49.229 31. 624 17.922 1.00 39.35 £ 

AX2H 1738 CE3 TRP A 202 46.206 32.281 19.138 1.00 39.39 £ 

20 ATOM 1739 CZ2 TRP A 202 48.380 32.884 17.367 1.00 36.15 £ 

hXm 1740 C23 TRP A 202 46.356 33.542 18.57B 1.00 39.60 _£ 

AlflM 1741 CH2 TRP A 202 47.428 33.828 17.684 1.00 40.99 £ 

ATSH 1742 W C1Y A 203 43.245 29.564 20.942 1.00 25.59 M 

A3SM 1743 CA GLY A 2Q3 42.332 27.483 21.169 ItQQ 13.09 £ 

l*Mmi\7~M WM ~ GfrY^Aw203 M M M ^4>l^ 

Aiai 1745 o G LY A 203 41.340 29.915 22,996 l t Q0 22-96 £ 

ATOM 1746 N SER A 204 40.270 26.919 22.262 1.00 16.88 H 

him 1747 ca SER A 204 39,163 29.979 23.192 1.00 18.39 £ 

ATOM 1748 C SER A 204 39.561 26.664 24.659 1.00 22.07 £ 

30 ATOM 1749 O SER A 204 38.888 27.096 25.604 1.00 34.39 Q 

ATOM 1750 CB SER A 204 39,053 25.999 22,740 1 , 0Q 9.99 £ 

ATOM 1751 OG SER A 204 38.237 24.695 23.291 1.00 16.37 £ 

ATOM 1752 N GLY A 205 40.562 25.813 24.654 1.00 12.42 H 

ATOM 1753 CA GLY A 205 40.963 25.411 26.208 1.00 11.64 , £ 

35 ATOM 1754 C GLY A 205 40.208 24.178 26.711 1.00 19.49 £ 

ATOM 1755 O GLY A 205 40.422 23.723 27.838 1.00 13.59 £ 

ATOM 1756 W THR A 206 39.292 23.683 25.881 1.00 15.38 H 

ATOM 1757 CA THR A 206 38.432 22.594 26.281 1.00 10.80 £ 

ATOM 1758 C THR A 206 39.056 21.221 26.154 1.00 26.39 £ 

40 ATOM 1759 O THR A 206 38.564 20.267 26.737 1.00 23.28 £ 

mm 1790 CB THR A 206 37.124 22.562 25.490 1.00 12.96 : £ 

ATOM 1761 OG1 THR A 206 37.438 22.395 24.082 l.QQ 13.12 Q 

ATOM 176 2 CG2 THR A 206 36.348 23.840 25.620 1,00 10.62 £ 

ATOM 176 3 W PRO A 207 40.101 21.083 25.354 1.00 21.10 H 

45 ATOM 176 4 CA PRO A 207 40.65B 19.743 25.175 l.QQ 19.15 £ 
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41.31$ 19,191 2€,123 



.00 21.7? 



Si 
H 

Ji 

Ji 
Si 



ATOM 17 66 O PRO A 207 



41.951 19.925 27.215 



-00 20.65 



AXJ21 1767 CB PRO A 207 



41.638 19.909 24.013 



90 17.51 



A3 



1768 W PR? A 207 



41.146 21.213 23.307 



.00 21,4? 



AXQH 17 69 CP PRO A 207 



40.698 22.062 24.431 



00 23.44 



AXOH 1770 H MET A 208 



41.112 17,876 26.624 



.00 15.60 



10 



15 



ATPM 1771 CA MET A 209 



ATOM 
AIOL 



41.694 17.167 27.775 



.00 22.94 



him 1772 C MET A 209 



43.058 16.427 27.579 



.00 21.90 



bim 1773 O MET A 208 



43.248 15.677 26.633 



OQ 23.16 



ATOM 1774 CB MET A 208 



40.645 16. 273 28.386 



•00 32.86 



177? Cg MET A 209 



39. MO 17.057 29.223 



.00 46.17 



1776 SD MET A 208 



39,301 15.990 29.926 



.00 57.95 



ATOM 1777 CE MET A 208 



37.999 15.028 28.343 



.00 59.23 



AI2M 1779 H ARg A ,209. 



44,922 16.691 29.456 



00 17. IS 



ATOM 1779 CA ARG A 209 



45.318 16.042 28.324 



00 19.99 



ATOM 1760 C ARG A 209 



45.971 15,534 29.639 



.00 16.92 



ATOM 
ATOM 



1791 0 ARE A 209 



45.433 15.946 30.697 



.00 16,59 



1782 CB ARG A 209 



46,340 16.963 27.659. 



.00 21.07, 



A12S1 1783 CG ARG A 209 



45.990 17.479 26.275 



.00 22,57 



20 



ATOM 1784 CD ARG A 209 



45.833 16.357 25.282 



■00 28.26 



Si 

H 
Ji 
Ji 
Si 
Si 
_& 

.£ 

SI 
SI 
JS 
Si 
Si 

Si 
Si 
Si 
j£ 
Si 
Si 
£. 



AXGM 1785 NE ARG A 209 



45.586 16.819 23.906 



00 23.15 



A3 



1796 CZ ARg A 209 



44.420 16.742 23.267 



00 34.52 



AlflM 1787 NHl ARG A 209 



43.336 16. 267 23.890 



.00 18.03 



25 



A1SS1. 
AXGH. 



17SB KH2 ARG A 209 



44.339 17.175 22.012 



.00 29.79 



1799 M gLV A 210 



AISM 1790 CA GLU A 210 



46.978 14.675 29.547 



■00 20.97 



47.530 14.079 30.720 



,00 17.37 



ATOM 1791 C gLV A 210 



49.031 14.490 30.951 



-00 20.96 



AXPU 1792 O GLU A 210 



49.748 14.622 29.841 



00 22.44 



1793 CB GLP A 210 



47,400 12,562 30,571 



.00 16.26 



30 A TPK 1794 CG GLU A 210 



47.807 11.785 31.809 



00 19.91 



AI2M 1795 CD GLU A 210 



48.057 10.304 31.531 



.00 27.91 



ATOM 1796 obi gLV A 210 



49.111 9.919 30,343 



.00 17.29 



ATOM 1797 QE2 GLU A 210 



48.268 9.540 32.494 



-00 21-63 



AT9M 1798 W PHE A 211 



49.504. 14.712 32.094 



,00 14.02 



35 



ATOM 1799 CA PHE A 211 



50.997 15,159 32.353 



,00 17.48 



AIQM 1800 C PHE A 211 



51,456 14.414 33.531 



.00 33.62 



AXO* 1801 O PHE A 211 



50.716 14.031 34.443 



,00 27.96 



40 



ATOM 
ATOM 



1802 CB PHE A 211 



50.933 16. 677 32.644 



,00 17.79 



1803 CG PHE A 211 



50.303 17.490 31.541 



.00 21.49 



AISM 1804 CD1 PHE A 211 



51.009 17,676 30.320 



,00 17.36 



ATOM 1805 CD2 PHE A 211 



48.933 17. 844 31.618 



00 15,09 



AIQM 1806 CE1 PHE A 211 



50.399 18.334 29.237 



.00 16,37 



AIOH 1807 

ATOM 1808 



CE2 PHE A 211 

CZ PHE A 211 



48.288 18.491 30.533 



.00 9.61 



49.053 18.756 29.344 



.00 12. 71 



45 



ATOM 1809 W LEU A 212 



52,761 14.161 33.495 



.00 23.76 
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ATOM 1B10 CA LEU A 2^ 



53.405 13.446 34.603 



h3m 1911 C , LEV A 2.1,2 



54.772 14.053 34.B9B 



ATOM IB 12 O LEU A 212 



55.519 14.39B 33.985 



him 1913 CB LEV A 212 



53.54B 11.954 34.294 



ATOM 1814 CG LEU A 212 



54.033 11.039 35.406 



ATOM 1815 CD1 LEU A 212 



52.866 10.634 36.2B0 1 



ATOM 1816 CD2 LEU A 212 



ilJfifi 9.829 34.B32 



ATOM 1917 N HIS A 213 



55.023 14. 302 36.175 



AXQM 1919 CA HIS A 213 



56.290 14,961 36.555 



10 



AISM 191? C HIS A 213 



57.390 13.929 36.233 



him 19,20 ,. P his a 213 



57.238 12. 614 36.542 



AXSM 1821 CB HIS A 213 



5g r 29P . 15.25P 39 .002 



ATQfl lfi22 CG HIS A 213 



57.131 16.017 39. 109 



AlSai 1623 ND1 HIS A 213 



59.703 15,106 39.656 



15 



him 1921 CD2 HIS A 213 



57.716 17.353 39.199 



him 1825 CEl HIS A 213 



59.615 16.331 38.917 



ATOM 1826 NE2 HIS A 213 



59.041 17.523 39.917 



ATOM 1927 W VA1 A 211 



59^.159 



14.295 35.999 



ATOM IfiZfl CA VAX A 211 



59.532.. 13.393 35 ,361 



20 



ATOM U29, C VAL A 211. 



S0 T 0E7 i2 T 5g3 36.551 



ATOM 1830 O VAL A 214 



60.604 11.444 36.359 



ATCM 1931 CP YA1 A 214 



60.625 14.125 34.566 



AI£M 1132 CEl VAL A 214 



61,39P 15.199 35.495 



ATOM 1833 CG2 VAL A 214 



61 t .56P 13 . P97 , 3 3.902. 



37. 7 9 .P. 



25.29 




him 1936, 



ATOM 



1B37 



o 



ASP A 215 



59.53P 11.023 39.230 



■ASP A 215 



59.988 9.981 39.666 



ATOM . 1939.. CB ASP A 215 



60.575 13.129 40.155 



30 



ATOM 1839 CG ASP A 215 



6i'.859 13. 979 40.068 



ATO M 19 . 40 QP1 ASP A 215 



6^782 13.614 39,309 



ATOM 1841 OD2 ASP A 215 



61.957 15.029 40.730 



ATPM 1942, W ASP A 216 



59.276 H.13J? 39.963 



35 



ATPK 1943 

ATOM 1844 



CA ASP A 216 

C ASP A 216 



51 . 37 ? 1Q. 0 1 7 39 .P16 



57..761 9. 093. 37.994 



ATOM 1845 0 ASP A 216 



57.715 7.880 38.026 



ATOM 1846 CB ASP A 216 



55.912 10,457 39.921 



ATOM 1847 CG ASP A 216 



55 T 193 10.757 40,162 



ATOM 1848 OD1 ASP A 216 



55.50? 10,U9 41.223 



40 



ATOM . 1 949. PP2 ASP A 216 



54.249 11.597 40.124 



ATOM 

ATOM 



1950 N MET..A..21L 



58.092 9.653 36.755 



1851 CA MET A 217 



58.394 8.785 35.636 



ATOM 1B52 C MET A 217 



59,572 7t942 35.992 



ATOM 185 3 O MET A 217 



59.579 6t752 35.710 



45 ATOM 1854 CB ME 7 A 217 



58.637 9.592 34.345 



.00 21.24 



.00 14.00 
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rOQ 27.51 
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atom 1855 cg met a 217 59.478 8.918 l.PP 16.37 £ 

ATOM 1856 SP MET A 217 58.962 L1« 3 2i4 73 1.00 30.51 & 

ATOM 1B57 CE MET A 217 57.465 3?. 391 l.Op 1?,.,?7 £ 

ATOM 1658 N ALA A 218 6P.561 £.562 36.623 1.00 19.09 H 

5 atom 1859 ca ALA A 216 61.774 7.841 32.PP2 ■ 1.PP 13.65 £ 

atom 18 60 c ala a 218 61.436 k77B 38.P28 l.PP 22.61 £ 

ATOM 1861 O ALA A 218 61.934 5.670 37-967 1.00 19.36 Q 

atom 1862 cb ala a 218 62.8P9 8»78P 37,579 ItPP 12.2? £ 

atom 1863 N ALA A 219 6P.6P5 7,1P9 39.QPQ l.QQ 19.34 H 

10 ATOM 1864 CA ALA A 219 60.310 6.105 40.023 1.00 18.01 £ 

ATPM 186$ c ALA A 219 59.63P 4.9P1 39.413 l.PP 23. £7 £ 

ATOM 1866 O ALA A 219 59.781 3.777 39.898 1.00 22.71 O 

ATOM 1867 CB ALA A 219 59.387 6.678 41.083 1.00 10.11 £ 

ATOM, 18 6Q N ALA A 2ZQ 58.753 5.174 36.454 1.00 18.99 

15 ATOM 18 69 CA ALA A 220 57.905 4.158 37.855 1.00 14.12 £ 

ATQM XQ7 p £ ALA A 22P 56.753 3.213 37.034 1.00 25.33 £ 

ATOM 1871 q ALA A 22P 58.584 2.PP6 37^114 l f 0P 2P.63 Q 

ATOM 1872 CB ALA A 220 56.796 4.798 37.023 1.00 8.53 £ 

AI2i 1873 W SEE A 221 59,770 ?^72 36.379 l.M 23.92 Ji 

20 ATOM 1874 CA SER A 221 60.702 3.011 35.556 1.00 18.38 £ 

ATOM 1875 £ SER A 221 61.537 1.989 36.353 1.00 20.90 , £ 

ATQM 1876 Q SER A 221 61.68? 0.799 35.983 l.PP 19.84 Q 

A1GM 1877 cb ser a 221 61 . 6P4 3,985 34.8P4 l.PP 1P.67 £ 

atom 1878 og ser a 221 6P.947 4,744 33.357 l.PP 15.61 P 

25 ATOM 1879 N ILE A 222 62.083 2.476 37.463 1.00 18.12 H 

ATOM 1880 CA ILE A 222 62.866 1.644 38.381 1.00 21.56 £ 

ATQM mi c ILE A 222 62.Q2P P. 554 39.P68 l.PO 29. IP ; £ 

ATOM 1882 O ILE A 222 62.504 -0.566 39.307 1.00 19.03 Q 

ATOM 1883 CB ILE A 222 63.467 2.516 39.432 1.00 24.56 £ 

30 ATOM 1884 CGI ILE A 222 64.465 3.473 38.765 1.00 32.13 _£ 

ATOM 1885 CG2 ILE A 222 64.129 1.671 4P.500 l.PP 28.26 £ 

ATOM 1886 CD1 ILE A 222 64.973 4.585 39.649 1.00 15.61 £ 

ATOM 1887 N HIS A 223 60.772 0.907 39.384 1.00 19.34 , fi[ 

ATOM 1888 CA HIS A 223 59.829 -0.031 39.996 1.00 20.46 £ 

35 ATPM 199? £ HIS A 223 59 T 59.9 -1.P97 38.964 ItPP 24.82 £ 

ATOM 1890 O HIS A 223 59.723 -2.283 39.270 1.00 24.66 2 

ATOM 1891 CB HIS A 223 58.46,5, P.637 4P.359 l.QQ 19.53 £ 

ATOM 1892 CG HIS A 223 57.373 -0.333 40.759 1.00 26.64 £ 

ATOM 1B93 ND1 HIS A 223 57.021 -0.564 42.062 1.00 24.16 H 

40 ATOM 1894 CD2 HIS A 223 56.497 -1.062 40.004 1.00 30.39 £ 

ATOM 1895 CE1 HIS A 223 55.983 -1.399 42.112 1.00 30.39 £ 

ATOM 1896 NE2 HIS A 223 55.652 -1.727 40.869 1.00 28.13 H 

atom 18 97 h VAX A 224 59.354 -P.684 37.725 IrPP 22.P6 H 

ATOM 1898 CA VAX A 224 59.111 -1,657 36.652 ItPP 19.1.5 _£ 

45 ATOM 1899 C VM A 224 60.350 -2.490 36.333 1.00 25.89 £ 
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ATPM 1900 P VAL A 224 60.2B2 -3.709 36.250 1.00 22.37 Q 

ATOM 1901 CB VAL A 224 58.559 -1.022 35.377 1.00 22.59 £ 

ATOM 1902 CGI VAL A 224 58.512 -2.050 34.231 1.00 22.61 £ 

ATPM 1903 CS2 VAL A 221 57.161 -0.491 35.650 1.00 23.41 £ 

5 ATOM 1904 N MET A 225 61.499 -1.838 36.255 1.00 27.83 E 

ATOM 1905 CA MET A 225 62.710 -2.577 36.004 1.00 23.69 £ 

ATOM 1906 C MET A 225 62.896 -3.678 37.071 1.00 31.95 £ 

ATOM 1907 O MET A 225 63.290 -4. BOS 36.785 1.00 24.33 £ 

ATPM 1909 , CB , MET A 225 63.902 -1.604.36.056 1,00 21.34 £ 

10 ATPM 1909 W MET A 225 65 » 295 -2.296 35.999 1.00 17.83 £ 

ATOM 1910 SD MET A 225 65.750 -2.958 34.306 1.00 23.33 & 

ATOM 1911 CE, MET A 225 67,090 -1,996 33,795 l.QQ 16.46 £ 

ATOM 1912 H GLU A 226 62.644 -3.319 38 .316 1.00 19.54 H 

ATOM 1913 CA GLU A 226 62.988 -4.161 39.428 1.00 21.58 £ 

15 ATOM 1914 C GLU A 226 61.999 -5.200 39.918 1.00 30.77 _£ 

ATOM 1915 £ GLU A 226 62.308 -6.012 40.780 1.00 29.39 Q 

ATPM 1916. CB <?LV A 226 . 63.613 -3.323 40.547 ItPP 20.47 £ 

ATOM 1917 CG GLU A 226 64.937 -2.673 40.122 1.0O 23.03 C 

ATOM 1918 CD GLU A 226 65.504 -1.809 41.208 1.00 32.62 £ 

20 ATOM 1919 PE1 GLV A 226 64.721 -1.455 42.122 1.00 26.12 Q 

ATOM 1920 OE2 GLU A 226 66.711 -1.479 41.152 1.00 17.67 Q 

ATOM 1921 N LEV A 227 60.937 -5,249 39.295 1.00 34.11 H 

ATOM 1922 CA LEU A 227 59.883 -6.296 39.642 1.00 35.26 £ 

ATOM 1923 C LEU A 227 60.537 -7.644 39 .320 1.00 27.93 £ 

^^ATCM1925Tbi!evA22 

ATOM 1926 C<? LEV A 227 57,391, -5,569 39,955 l.QQ 40.30 £ 

him 1927 CP1 LEV A 227 57,697 -4 f 194 39,392 1-QQ 42.04 £ 

ATOM 1928 CD2 LEU A 227 56.610 -5.577 37.647 1.00 46.21 £ 

30 ATOM 1929 N ALA A 229 60.026 -g.699 39,955 1,00 27.15 tt 

ATOM 1930 CA ALA A 228 60.425 -10.051 39.616 1.00 25.26 £ 

AXPH 1931 C ALA A 228 59.801 -10.435 38.279 1.00 27.93 £ 

ATOM 1932 O ALA A 228 56.624 -10.093 37.934 1.00 31.26 Q 

ATOM 1933 CB ALA A 228 60.003 -11. 052 40.703 1.00 22.05 £ 

35 ATOM 1934 N HIS A 229 60.624 -11.160 37.539 1.00 27.05 fi 

ATPM 1935 PA HI? A 229 6Q,275 -11,605 36.222 1.00 24.4? £ 

ATOM 1936 C HIS A 229 58.905 -12.2 60 36.184 1.00 21.74 £ 

ATPM 1937 o his A 229 59,015 -11,951 35.399 1-QQ 22,22 o 

ATOM 1938 CB HIS A 229 61.351 -12 .520 35.698 1.00 17.71 £ 

40 ATOM 1939 CG HIS A 229 61.284 -12.70 1 34.220 1.00 27.24 £ 

ATOM 1940 ND1 HIS A 229 61.060 -11.650 33.350 1.00 34.38 H 

AIC84 1941 CP2 HIS A 229 61.292 -13.921 33.465 l-QQ 31,45 £ 

ATPM 1942 CE1 HIS A 229 60.992 -12.113 32.115 1.00,30.50 £ 

ATOM 1943 NE2 HIS A 229 61.124 -13.427 32.159 1.00 35.23 fi 

45 ATPM 1944 N. ELV A 230 59.691 -13.161 37.140 1,00 20.24 U 



i 

i! 
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AT CM 1945 CA GLU A 230 57.425 -13.695 31.209 1.00 29.41 C 

ATOM 1946 C GLU A 230 56.181 '13.051 31.341 1.00 22.20 £ 

ATOM 1947 O GLU A 230 55.159 -13.359 36.679 1.00 17.78 

AT CM 1948 CB GLU A 230 57.464 -14.997 38.274 1.00 38.51 £ 

5 ATOM 1949 CG GLU A 230 58.065 -14.582 39.567 1.00 63.09 £ 

ATOM 1950 CD GLU A 230 S7.036 -14.473 40.661 1.00100.00 £ 

ATOM 1951 OEl GLU A 230 55.859 -14.872 40.400 1.00100.00 £ 

AlCtt 1952 PE2 01V A 230 57,109 -14,003 41.7,69 1.P0 91.49 C 

ATOM 1953 N VAX A 231 56.272 -12.004 38.182 1.00 16.53 H 

10 ATOM 1954 CA VAL A 231 55.202 -11.029 38.356 1.00 20.23 £ 

ATOM 1955 C VAL A 231 55.009 -10.164 37.102 1.00 24.45 £ 

aiaa 1956 . o val a 231 53.864 -9.834 36.705 1.0021.00 q 

ATCM 1957 cp VAL a 231 55,541-10.057 39. 426 1,00 20.61 £ 

ATPM 1959 Cgl VAL A 231 54.362 -9.Q99 39.510 1.00 29,79 £ 

15 AXaj 1959 Cg2 m A 231 55.B91 -10.757 40,577 1-.00 29,96 £ 

ATCM 1960 N TOP A 232 56.13? -9,799 35.495 1.00 17.17 H 

A1SM 1961 CA TRP A 232 56.052 -9.044 35.262 1.00 21.52 £ 

AIGM 1952 £ TRP A 232 55.399 -9,944 34.155 1,00 20t53 £ 

ATCM 1963 O TRP A 232 54.588 -9.306 33.380 1.00 24.31 Q 

20 ATCM 1964 CB TRP A 232 57.438 -8.6 44 34.801 1.00 29.88 £ 

AXSM 1955 eg TOP A 232 57.430 '7,943 33.500 1.00 27.55 £ 

AISM 1966 CDl TRP A 232 57.184 - 6.464 33.356 1.00 25.42 £ 

ATOM 1967 CD2 TRP A 232 57.714 -8.3 36 32.169 1.00 27.75 £ 

ATCM. 1969 WEI TOP A 232 ...57.325 -6,095 32,033 1.00 22.5? B 

25 ATCM 1969 CE2 TRP A 232 57.655 -7.203 31.279 1.00 25.11 £ 

AICM 197Q CE3 TOP A 232 58,p?7 - 9,^3 31,640 1.00 22,72 £ 

ATCM 1971 CZ2 TRP A 232 57.917 -7.31 6 29.879 1.00 17.23 £ 

ATCM 1972 C23 TRP A 232 58.238 -9.720 30.223 1.00 25.97 £ 

ATCM 1973 CH2 TRP A 232 58.154 -8.581 29.368 1.00 22.07 £ 

30 ATCM 197.4 1* LEU A 233 55.749 -11.121 34.018 1.00 23.80 ; H 

AISM 1975 OA LEV A 233 55.141 -11.949 32.937 1,00 24.79 £ 

AIGM 1976 C LEU A 233 53.652 -12.118 33.122 1.00 24.51 £ 

ATCM 1977 O LEU A 233 52.865 -12.075 32.163 1.00 28.50 £ 

AT O M 1978 CB LEU A 233 55.765 - 13.348 32.B20 1.00 26.20 £ 

35 ATCM 1979 CG LEU A 233 57.250 -13.505 32.503 1.00 19.39 £ 

ATCM I960 CDl LEU A 233 57.745 -14.850 33.023 1.00 19. 9Q £ 

ATCM 1981 CD2 LEU A 233 57.561 -13.287 31.017 1.00 16.01 £ 

him 1992 K GLU A 234 53.29? - 12.343 34.372 1.00 25.45 N 

AX2M 1983 CA GLU A 234 51.929 -12.523 34.822 1.00 30.04 £ 

40 ATCM 1984 C GLU A 234 51.128 -11. 319 34.367 1.00 35.69 C 

AXQM 191.5 O GLU A 234 49.926 -11.390 34.052 1,00 29.25 Q 

AIGM 1986 CB GLU A 234 52.007 -1 2.468 36.344 1.00 37.30 , £ 

ATCM 1987 CG GLU A 234 50.908 -13.133 37.118 1.00 45.39 £ 

ATOM 1999. CP <?LV A 234 51 , 112 -12,991 39.6Q1 1,00»Q,,00 £ 

45 &XCH 1989 OEl GLU A 234 52.240 - 13.137 39.104 1.00 99.09 Q 
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ATOM 1990 

ATOM 1991 



SE2 OLV A 22± 
Ji AffN A 2?? 



50.211 -12,257 39.211 



51.807 -10.184 34.364 



AI£M 1992 CA ASN A 235 



51.109 -8.996 33.992 



hXm 1993 C ASM A 235 



51.280 -8.494 32.571 



ATOM 1994 O ASN A 235 



50.824 -7. 393 32.25? 



AIOl 1995 CB ASN A 235 



51. 42^ -7.995 34.9B1 



ATOM 1996 Cg ASK a 235 



50,979 -9.197 36.342 



AISM 1997 PP1 ASN A 235 



49.722 -7.882 36.628 



ATCM 1999 NP2 ASW A 235 



51.653 -8.934 37.140 



10 



ATOM 1999 N THR A 236 



51.935 -9.268 31.708 



ATCM 20PP CA 



A 236 



52,1P9 -8,795 3P.344 



A3 



2001 c 



A 236 



51.967 -9.913 29,419 



AlflM 2PP2 Q THR A 236 



51.551 -11.P33 29.995 



ATOM 2PP3 CB, 



A 236 



53.545 -8.306 30.161 



15 



A3 



20Q4 g£l THR A 236 



54.422 -9.325 30.636 



ATOM 2PP5 CC2 THR A 236 



53.9P1 -7.049 31,941 



ATOM 2PP6 N SLH A 237 



52.903 -9.699 29.1Q9 



ATOM 2007 CA GLN A 237 



52.P97 -!Pr793 27.122. 



20 



A3 
A3 



2PP9 



SLH A 237 



53.335 -1P.5P7 26.331 



2PP9 Q SLN A 237 



53.729 -9.362 26.204 



ATOM 2010 CB GLN A 237 



5P.913 -lPt999, 26.199 



A1QM 2011 . Cg GLN A 237 



49.639 -11.096 26.904 



ATOM 2012 CD GLN A 237 



49.907 -9.962 26.606 




ATOM 2016 CA PRO A 238 



55,275 -11.439. 25.246 



AXQtf 2P17 C . PRP A m 



55.194 -IP. 6*3 23.959 



ATCM 2018 O PRO A 238 



56,181 -10. 029 23.600 



30 



ATOM 2019 CP PRO A 239 



55,7?3 -12,97? 25 t Oil 



ATOM 2020 CG PRO A 238 



54^98 -13.710 25.886 



ATCM 2021 CD PRO A 238 



53.626 -12.999 26.Q69 



ATOM 2022 V MET A ?39 



$4,041 -io t 63S 23.296 



ATOM 2023 CA MET A 239 



53.924 -9. 807 22.104 



35 ATCM 2024 C MET A 239 



53.1P9 -9.509 22.362 



ATOM 2025 O MET A 239 
ATOM 2026. CB MET A 239 



52.792 -7.741 21.419 



53.460 -10,599 20.991 



ATOM 2027 CG MET A 239 



54.536 -11.534 20.261 



ATCM 2028 SD MET A 239 



53.994 -12.534 18.808 



40 



ATOM 2P29 CB MET A 239 



54.350 -11,357 17.41 22 



ATOM 2030 N LEU A 240 



52.947 -9.252 23.646 



ATCM 2P31 CA MB2 A 210 



52rl59 -7.P37 24.131 



A12M 2032 C. LEV A 240. 



52.774 -6,733 25*493 



ATOM 2033 O LEU A 240 



52.124 -6. 803 26.549 



45 



A12B 2034 CB LEV A 240 



50,645 -7.249 24.240 
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.00 16.91 
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CG 
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4?, 646 
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1.00 
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2036 


CD1 LEU A 240 
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25.033 


1.00 25.51 


.... G 
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54.076 


-6,461. 


25.456 
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1 . 00 
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ATOM 


2042 


-SB- 


SER A 241 
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-6.900 


26,495 


1.00 14.04 
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him 
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OG 
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1.00 
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... 
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ILE A 243 
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ATOM 


2053 


CGI 


ILE A 243 


54.014 
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32.735 


1.00 


17.37 


c 


ATGK 


2061 


CD1 


ILE A 24? 


54,725 


-3-?65 


32.714 


1,00 


14.82 


. . C 


A1SM 


2062 


N 


ASN A 244 


52.112 


1.217 


30.013 


1.00 16.43 


N 


ATOM 


2063 


PA 


A£N. A 244 


51.824 


2.699 


?0-0?8 


1.00 


19.99 


G 


ATOM 


2064 


c 


ASN A 244 


52,252 


?.292 


?1.349 


l.OQ 


19»63 


C 


AXPM 


2065 


0 


ASN A 244 


51.965 


2.727 


?2,405 


1,00 


1.9 » 59 


. G 


AX2W 


2066 


CB 


ASN A 244 


50.304 


2,997 


29.910 


1.00 


15.67 


-G 


ATOM 


2067 


OG 


ASN A 244 


49,769 


2.702 


29,517 


1.00 


14.57 


G 


ATOM 


2068 


OD1 


ASN A 244 


50,^46 


2,593 


27.580 


1,00 


13.64 


P 


ATOM 


206? 


NP2 


ASN A 244 


49.443 


2.491 


29r?9? 


1.00 10.16 


N 


ATOM 


2070 


N 


VAL A 245 


52.800 


4.499 


?1,326 


1-00 


13.50 




ATOM 


2071 


£A 


YAL A 245 


53.159 


5.134 


32,602 


.1-00 


1.3,49 


G 


ATOM 


2072 


0 


VAL A 245 


52.528 


6. .5 66 


32.644 


1,00 


16.25 


c 


ATOM 


2P73 


0 


VAL A 245 


52.786 


7,405. 


.31.770 


1.00 


_15i20 


o 


ATOM 


2074 


CB 


VAL A 245 


54.754 


5,16? 


32,910 


1.00 21.07 


, G 


ATOM 


2075 


001 


VAL A 245 


55.154 


6,095 


33,937 


1-00 


15.08 


C 


ATOM 


2076 


052 


YAL A 245 


55.280 


3,917 


33.143 


1.00 


15.82 


G 


ATOM 


2077 


N 


.GLY A 246 .. 


51.696 


6.943 


33,649 


1.00 14.03 


N 


ATOM.. 


2078 


CA 


GLY A 246 


51.027 


9-136 


33.707 


It 00 


16.87 


G 


ATOM 


2079 


c 


.GLY A 246 


50.146. 


9.203 


.34.239 


1.00 26.95 
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AXOl 2080 o SLY A 246 



50.323 7.401 35.950 



ATOM 2081 W 

aioi 2pe? ca 



A 247 



49.207 9.161 34.963 



A 247 



49.232 9,276 36,063 



AIQM 2083 C TOR A 247 



46.868 8.677 35.673 



ATOM 2084 O 



A 247 



46.069 8.306 36.508 



AT9M 
AIOL 



2085 CB THRA 247 



47.988 10.730 36.404 



2086 OG1 THRA 247 



47.409 11,399 35,265 



ATOM 2087 CG2 THR A 247 



49.275 11.378 36.724 1 



ATOM 2088 W GLY A 248 



46.583 8.651 34.384 



10 



Alffll 208? CA SLY A 248 



45,319 8.143 33.924 



ATOM 2090 C GLY A 248 



44,223 9.160 34,22$ 



ATOM 209 1 O GLY A 248 



43,059 8,86* 34,137 



ATOM 2092 N VAL A 249 



44,615 10.396 34.521 



ATOM 2093 CA VAX A 249 



43.673 11.464 34.827 



15 AlflM 2094 C VAL A 249 



43.747 12.596 33.786 



ATOM 2095 O VAL A 249 



44.853 13.006 33.387 



ATOM 2096 CP VAL A 249 



44.020 12.085 36.214 



ATOM 2097 CGl VAL A 249 



43.225 13.324 36.470 



AlflM 2222 CC2 VAL A 249 



43.782 11.083 37.306 



20 



ATOM 2099 N ASP A 250 



42.581 13.125 33,397 



ATOM 2100 CA ASP A 250 



42.489 14.232 32.439. 



ATOM 2101 C ASP A 250 



42.611 15.581 33.155 



AXflM 2102 P ASP A 250 



42.198 15,783 34.308 




AXCM Z1M PP2 ASP A 250 



39.767 12.501 31.153 



AICM 2107 N CYS A 251 



43.029 316,566 32,389 



ATOM 2108 CA CYS A 251 



42.962 17.906 32.851 



30 



ATOM 



2109 



CYS A 251 



42,918 19,779 31,577 



A3 



2110. 



CYS A 251 



43,699 19,560 30.633 



AXSM 2111 CB CYS A 251 



44,149 18 f 157 33-778 



AlflM 2112 gg CYS A 251 



45.129 19,619 33.453 



35 



AICH 2113 N 

ATOM 2114 CA 

ATOM 2115 C 

AIBi 2116 a 

him 2117 CB 



A 252 



41.932 19.673 31.494 



A 252 



41.834 20.588 30.335 



A 252 



42.999 21.592 30.236 



A 252 



43.657 21,926 31.249 



A 25? 



40.506 21.407 30.329 



ATOM 2118 OG1 T»R A 252 



40.460 22.304 31.447 



40 



AXPH 2119 C<?2 MB A 252 



39,309 20,495 30.372 



AX2M 2120 N ILE A 25? 



43.228 22.095 29.024 



him 2121 CA I LE A 253 



44,264 23,119 29.812 



XXm 2122 C I LB A 25? 



43.934 24.383 29.627 



ATCM 21Z2 Q ILE A 25? 



44.834 25.012 30.247 



•00 23.04 



.00 21.44 



.00 21.39 



.00 24.09 



.00 21.03 



QQ 22,24 



.0 0 . 1 9.62 



.00 19.99 



.00 24.95 



.00 22.61 



.00 21.42 



,00 25,70 



.00 30.72 



.00 26.09 



.00 32.70 



■00 26 .92 



00 39,59 



■00 36.11 



.00 41.30 



.00 27.95 



,00 20.64 



.00 27.63 



00 26.23 



■00 45.34 



.00 20.12 



.00 27.20 



,0P 26-47 



,00 19-45 



.00 34.86 



■00 29,47 



rQQ 14,85 



,00 21.21 



,00 2Pt5? 



.00 15.24 



,00 32.09 



■00 19.26 



,00 13.91 



.00 14.91 



.00 16,90 



,QQ 23.41 



,00 15.27 



45 



AlfiM 2124 CB ILE A 253 



44,404 23,452 27.302 1.00 24. Q5 



£ 
Si 
St 
Si 

SL 

Si 

_£ 
.£ 
SL 
JI 
-£ 
-£ 
-Q 
Si 

Si 

JQ 
H 
Si 
c 

Si 

_s 



Si 

Si 
0 

.£ 
_Q 
S. 
-H 
Si 

SI 

c 
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aiSM 212$ CJSl ILE A 253 44.862 22,200 26.561 l.oo 27. 33 £ 

AXCtf 2126 CS2 ILE A 253 1^123 24.479 27.077 i.qq 9.22 £ 

fl.TQM 2127 CDl ILE A 253 45.662 21.276 27.^ 52 1.00 49.56 £ 

ATOM 2126 N ARGA 254 42.637 24.709 29.707 1. 00 19. 56 H 

5 ATOM 2129 CA ARG A 254 42.22B 25.865 30.522 1.00 19.41 £ 

ATPM 2139 C ARg A 25* 42.712 25.713 31.970 1.00 IB. 10 £ 

ATOM 2131 O &BS A ZM 43.311 26.616 32.515 1.00 13.89 £ 

ATOM 2132 CB &B£ A 254 10,704 26,101 30.180 1.00 15.99 £ 

ATOM 2133 CG ARG A 254 40.282 27.378 31.255 1,00 9.96 £ 

10 ATOM 2134 CD ARG A 254 3B.809 27.702 31.218 1.00 24.79 £ 

AE2M 2135 NB Aftg A 251 38.498 28.414 29.997 1.00 29.42 N 

AIPH 2136. CZ .ARE A 25.1 38.693 29.723 29.794 1.00 59.8? £ 

AXPH 2137 NH1 Aftg A 25,1 39.194 30.527 30.732 1.00 42.5B H 

ATOM 2138 NH2 ARG A 254 38.377 30.245 28.620 1.00 18.44 

15 ATOM 213? N ASP A 255 42.406 24.564 32.586 1.00 20.22 N 

AIPH 2110 CA ASP A 25.5 42.795 24.205 33.974 1.00 16.48 £ 

ABQM 2111 g ASP A 255 44.321 24.372 34.069 1.00 22.43 £ 

ATOM 2142 O ASP A 255 44.868 24.897 35.060 1.00 18.53 Q 

AX2M 2143 CB ASP A 255 42.478 22.666 34.157 1.00 19.17 £ 

20 ATOM 2144 CG ASP A 255 42.144 22.246 35.61 0 1.00 47.08 £ 

ATOM 2145 OD1 ASP A 255 41.780 23.090 36.429 1.00 49.66 Q 

AXOH 2116 WZ.AS? A 255 12t020 21.016 35.990 1.00 1&JL2 2 

AIPH 2147 N LEU A 256 45.014 23.809 33.078 1.00 15.98 N 

Aias 2119 CA LEV h 256 46.465 23.844 33.06? l.QQ 21-76 £ 

25 axon 214? C LEV A 256 17.020 25.275 33.076 1.00 16.7? £ 

AIS2H 2150 O LEV A 256 47.825 25.697 33.946 1.00 15.24 Q 

ATOM 2151 CB LEU A 256 46.967 23.056 31.859 1.00 23.33 £ 

AT OM 2 152 Cg LEV A 256 10,1?1 23.100 31.765 1,00 26.80 £ 

ATOM 2153 CDl LEU A 256 49.171 22.334 32.984 1.00 17.13 £ 

30 AI34 2151 CP2 LEV A ?56 4? T 04Q ?2 t 724 30,34$ l.QQ 15,42 C 

ATOM 2155 N ALA A 257 46.520 26.048 32.140 1.00 13.77 H 

ATOM 2156 CA ALA A 257 46.938 27.436 32.025 1.00 12.70 £ 

ATOM 2157 C ALA A 257 46.656 28.237 33.267 1.00 10.73 £ 

ATOM 2158 O ALA A 257 47.451 29.073 33 .672 1.00 20.33 Q 

35 ATOM 215? Cg ALA A 257 16.209 29.073 30. 931 liQQ 13.31 £ 

atom 2160 N gLN A 259 , 15.170 29 ,0 9 0 33,935 1.00 12.40 . M 

ATOM 2161 CA GLN A 258 45.102 28.911 34.981 1.00 8.39 £ 

ATPM 2162., C gLN A 259 15 , 97? 29.190 36.166 1,00 13.19 £ 

ATOM 2163 O GLN A 258 4 6.178 29.281 37.029 1.00 22.96 £ 

40 ATOM 2164 CB GLN A 258 43.614 28.761 35.305 1.00 16.12 £ 

ATOM 2165 CG GLN A 258 42.674 29.0 96 34.130 1.00 30.19 £ 

ATOM 2166 CD GLN A 258 42.574 30.585 33.781 1. 00 37.29 £ 

&XP£ 2167 0E1 gLN A 2 s 9 42.311 31.171 31,610 1.00 21.21 . Q 

ATOM 2168 NE2 GLN A 258 42.021 30.876 32.572 1.00 15.94 N 

45 ATOM 2169 N THR A 259 46.179 27.182 36.232 1.00 16.21 N 
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ATOM 2170 CA 



A3 



A 25? 



46,982 26, 678 37.336 



ATOM 2171 £ THR A 259 



48.410 27.186 37.233 



ATOM . 2172 Q 



A 253. 



49.092 27.621 39,214 



2173 CB THR A 259 



47.066 25.192 37.361 



ATOM 2174 OG1 THR A 259 



45.752 24. 620 37.SQ9 



.00 16.95 



.90 20. 5f 



,00 21.44 



.00 27.56 



.00 20.92 



ATCM, 2175 cg2 THR A 259, 



47.936 24.796 39.545 



t90 12.95 



10 



ATOM 2176 N ILE A 260 



ATOM 



48.952 27. 170 36.028 



JIB 19.96 



ATOM 2177 CA ILE A 260 



50.292 27.704 35,939 



.00 23.01 



2178 



ILE A 260 



50.31? 29.190 36.225 



00 31.73 



ATOM 2179 O ILE A 260 



51.211 29. 627 36.993 



.00 25.90 



ATOM 2180 CB ILE A 260 



50.935 27.456 34.390 



.00 22.46 



ATOM 2181 CGI ILE A 260 



51.153 25, 940 34.232 



.00 24.12 



ATOM 2182 CG2 ILE A 260 



52.099 28.361 34.106 



.00 13.47 



15 



20 



AXflM 2183 CD1 ILE A 260 



ATOM 



51.501 25.443 32.810 



-00 12-58 



ATOM 2184 N ALA A 261 



49.280 29.910 35.764 



00 15.35 



AIQM 2185 CA ALA A 261 



49.177 31.355 36.048 



*0Q 16.00 



ATOM 2186 C ALA A 261 



49.316 31.604 37.550 



.00 20.58 



ATOM 2187 O ALA A 261 



50.104 32.443 37.987 



.00 16.09 



ATOM 2199 ALA A 261 



47.93 2 , 31 .959 35.497 



.00 13.65 



ATOM 2189 N LYS A 2 62 



48.551 30.843 38.323 



1.00 11-50 



AISM 2190 CA. LYS A 262 



49.579 30.905 39.770 



-00 10.13 



2191 



LYS A 262 



49.968 30.460 40.296 



■QQ 29.09 



ATOM 219 2 O LYS A 2 62 



50.503 31.084 41.205 



.00 29.37 



AXQM 2193 CB LYS A 2 62 



47.453 30.032 40.335 



00 12»5Q 



rnTUT— ~T gg-^kwaw2<62« 



.4,7«r332~2,9^9.62-^41...8 8 8. 



..00^16.51 



AIOl 2195 CD LYS A 262 



46.092 29. 092 42.371 



.00 46.61 



AXSW 2196 CE LYS A 262 



46.344 27.555 42.661 



iQO ?9t7Q 



ATOM 2197 K2 LYS A 262 

AISSH 2199 . .W VAL A 263 



45.157 26.703 43.200 



00 36.59 



50.589 29.443 39.705 



.00 17.44 



Jf 

s. 

S. 
Ji 
Si 
j: 

J2 
_£ 

-£ 
Ji 
X 
Si 



30 KHM 2199 CA VAL A 263 



51,915 29.039 40.171 



.QQ 19.72 



ATOM 2200 C VAL A 263 



52.997 30.170 39.997 



-00 32.12 



AIQM 2201 P VAL A 263 



53.971 30.412 40.934 



■ PO 21.19 



AXQH 2202 CB VAL A 263 



52.389 27.709 



JOS. 



.00 16.35 



ATOM 2203 CGI VAL A 263 



53.920 27 T 519 39,647 



.00 11>93 



35 ATOM 2204 CG2 VAL A 263 



51.646 26.522 40.093 



.00 14.99 



ATPM 2205 N VAL A 264 



52.913 30. 899 38.909 



00 21.75 



ATPM 2206 CA VAL A 264 



53.917 31.877 38.653 



.00 19.91 



AXCM 2207 C VAL A 264 



53.719 33.208 39.377 



•00 35.79 



40 



ATOM- 
ATOM 



2209 P VAL A 264 



54.632 ?4r032 39-492 



.QQ 29.99 



2209 CB VAL A 264 



54.05? 33 t QH 37 t 17? 



.00 24-27 



A1CM 2210 CGI VAL A 264 



54.728 33.269 36.822 



.00 33.59 



AESai 2211 CG2 VAL A 264 



54.840 30. 808 36.674 



-00 23.01 



A1SM 2212 W GLY A 2 65 



§2.550 33.379 39,969 



l.QQ 25.30 



Aiai 2213 CA GLY A 265 



52.241 34.620 40.636 



.00 24.14 



45 AZ2M 2214 C GLY A 265 



51.730 35. 694 39.632 



.00 35.03 
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ATOM 221S O GLY A 265 51 .773 36.911 39.96? 1.00 33.71 Q 

fiTOK 2216 N TYR A 266 51.794 35.257 38.428 1,00 26.25 N 

ATOM 2217 CA TYR A 266 50.698 36.151 37.373 l.QQ 26.55 £ 

him 2219 C TYR A 269 49.364 3S,745 37,819 It 00 31.91 £ 

5 ATOM 2219 O TYR A 266 48.532 36.067 38.456 1.00 27.99 Q 

ATOM 2220 CB TYR A 266 $Q,$Q] 35,463 ?^ t pQg 1 T 00 24-31 £ 

ATOM 2221 CG TYR A 266 49.994 36.381 34.884 1 .00 28.64 £ 

ATPM 222? CP1 TYR A 266 $0.670 ?7.592 34.542 1-00 35.05 £ 

ATOM 2223 CD2 TYR A 266 48.860 36.038 34.118 1.00 22.60 £ 

10 ATOM 2224 CE1 TYR A 266 50.212 38.434 33.472 1.00 20.73 £ 

&TPM 2225 CS2 TYR A 26$ 49.429 36.959 33,012 ItPP 20.93 £ 

him 2226 CZ TYR A 266 49.088 38.062 32.735 1.00 23.85 £ 

ATOM 2227 OH TYR A 266 48.622 38.851 31.710 1.00 33.40 Q 

him 2229 H LYS A 267 49.217 38,043 37.604 1.00 25.72 H 

15 ATOM 2229 CA LYS A 267 47.988 38.697 38.0 09 1.00 30.77 _£ 

him 2230 C LYS A 267 47.217 39.280 36.798 1.00 28. BS £ 

him 2231 Q LYS A 267 46.179 39.894 36.949 1.00 31.17 £ 

him 2232 CB LYS A 267 48.279 39.741 39.092 1.0 0 27.13 C 

him 2233 ..c? LYS A 267 49.729 39.129 40.403 1.00 23.18 £ 

20 him 2234 CD LYS A 267 4B.420 40.096 41.562 1.00 30.98 £ 

jyTOM 2235 CE LYS A 267 47.933 39.358 42.820 1.00 48.52 £ 

ATOM 2236 NZ LYS A 267 47.00^ 38.208 42.505 1.00100.00 

AIQH 2237 N GLY A 268 47.716 39.054 35.594 1.00 22.67 H 

ATOM 2238 CA GLY A 268 47.019 39.518 34.394 1.O0 21.38 £ 

25 ATOM 2239 C GLY A 268 45.856 38.568 34.085 1.00 31.03 £ 

AT OM . 2240 O GLY A 2 68 45.455 37.728 34,911 liPP 19,7] Q 

him 2241 ,N ARG a 269 45,397 39, $4? 32.949 IrQO 30. u 

him 2242 CA ARg A 269 44.293 37.946 32.399 1,00 26.47 £ 

ATOM 2243 C ARG A 269 44.680 36.705 31.489 1.00 22.35 £ 

30 him 2244 O ARGA 269 45.378 36.926 30.524 1.00 22.75 £ 

ATOM 2245 CB ARG A 269 43.297 38.753 31.626 1.00 22.65 B £ 

him 2246 eg ARG A 269 42.201 39,390 32,463 1,00 24.21 , £ 

him 2247 CD ARG A 269 40.936 39.465 31.568 1.00 83.45 £ 

ATOM 2248 ARG A 269 40.113 40.676 31.762 1.00100.00 H 

35 ATOM 2249 CZ ARG A 2 69 38.808 40.751 31.431 1.00100.00 £ 

ATOM 2250 NH1 ARG A 2 69 38.201 39.691 30.921 1.00 99.93 N 

ATOM 2251 MH2 ARG A 269 38.094 41.865 31.663 1.00100.00 ti 

ATOM 2252 N VAL A 270 44.195 35.494 31 .75 9 1 ,00 19 ,97 M 

ATOM 2253 CA VAL A 270 44.468 34.3B9 30.856 1.00 24.82 £ 

40 him 2254 C VAL A 270 43.319 34.456 29.824 1.00 22.51 £ 

him 2255 O VAL A 270 42.145 34.501 30.181 1.00 25:79 Q 

ATOM 2256 CB VAL A 270 44.436 32.979 31.57 1 1.00 24.03 £ 

ATOM 2257 CGI VAL A 270 4 4.576 31.861 30.533 1.00 20.72 £ 

ATOM 2258 CG2 VAL A 270 45.506 32.849 32.639 1.00 11.27 £ 

45 ATOM 2259 N VAL A 271 43.660 34.409 28.554 1.00 25.18 H 
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10 



ATOM 


22 60 


CA 


VAL A 271 


42 . 666 


34.492 


27.487 


1.00 


28.32 


c 


ATOM 


2261 


C 


YAL A 271 


42 , 819 


33.370 


26.442 


1.00 24.89 


c 


ATOM 


2262 


9 


VAL A 271 


43 . 923 


33.115 


25.980 


1,00 


21 « 99 


0 


ATOM 


2263 


CB 


VAL A 271 


42.901 


35.813 


26.736 


1.00 


29,25 


c 


ATOM 


2264 


CGI VAL A 271 


42 . 256 


35.773 


25 .370 


1.00 


31,91 


c 


ATOM 


2265 


CG2 VAL A 271 


42 . 421 


36.989 


27.565 


1,90. 


19.72 


c 


ATOM 


2266 


N 


PHE A 272 


41.716 


32 .758 


26.019 


1.00 


26.14 




ATOM 


2267 


.CA, 


PHE A 272 


41 752 


31.747 


24.963 


1.00 


24.34 


c 




2266 


9 


PHE A 272 


41.236 


32 .266 


23 . 623 


1.00 


29.95 


c 


ATOM 


2269 


9 


PHE A 272 


40 . 155 


32 .826 


23.582 


1.00 


22.01 


0 


ATCM 


^270 


CB- 


PHE A 272 


40.960 


30.506 


25.391 


1.00 


20.97 


c 


ATOM 


2271 


CG 


PHE A 272 


41 . 764 


29.570 


26.243 


1,00 


21.77 


c 


ATOM 


2272 


CPl 


PHE A 272, 


41 940 


29. 842 


27. 610 


1.00 


14,60 


c 


ATOM 


2273 


CD2 


PHE A 272 


42 .504 


28 .550 


25.656 


1.00 


22,1? 


c 


ATOM 


2274 


CE1 


PHE A 272 


42 763 


29.041 


28, 434 


1*00 


17.89 


r 


ATOM 


727S 


CE2 


PHE A 272 


41 llfi 


27 726 


2 6 454 


1.00 


27,64 


r 


ATOM 


277fi 


9Z 


PHE A 272 


41 47R 


27 97Q 


77 851 


l.op 


25,14 


r 


ATOM 


2 277 


N 


ASP A 273 


47 019 


17 114 


77 542 


1.00 


29,45 


H 
It 


ATOM 






-ASJP A 27? 


41 "=> m 

lit?? ' 




21 714 


1.00 


22,33 




ATOM 


2279 


C 


ASP A 273 


40.896 


31.365 


20.493 


1.00 


25.67 


c 


ATOM 


2280 


o 


ASP A 273 


41.53? 


30.570 


19,7*3 


1.00 


17.91 


9 


ATCM 




CB 


ASP A 27? 


42.672 


33.114 


.20.343 


1.00 


21*45 


C 


ATOM 


2292 


CG 


ASP A 273 


42.131 


. 33.626 


18.990 


1.00 


26.89 


C 


ATOM 


2293 


GDI A£P A 273 


40.975 


33.249 


19,599 


1.00 


27,76 


, 9 



15 



20 



.25. 



ATOM 


2285 


N 


ALA A 274 


39.589 


31.284 


20.649 


IrOO 


15t59 


N 


ATCM . 


2296 


CA 


ALA A 274 


39,932 


30.129 


20.129 


1.00 


23r75 


c 


ATCM 


2287 


C 


ALA A 274 


39.953 


30.169. 


19.653 


1,00 


32.30 


c 


ATOM 


2288 


9 


ALA A 274 


38.284 


29.256 


18,929 


1,00 


29.37 


9 


ATCM 


2289 


CB 


ALA A 274 


37.567 


29.905 


20,777 


1,00 


18.87 


c 


AT9M 


2290 


N 


5EF> A 275 


39.372 


31.243 


19.041 


1-00 


21,10 


N 


ATOM 


2291 


CA 


SER A 275 


39.343 


31.28? 


16.631 


1.00 


26.90 


9 


ATOM 


2292 


c 


SER A 275 


40.390 


30.300 


16.116 


1,00 


43,37 


c 


ATOM 


2293 




SEP A 275 


40,421 


29,949 


14,927 


1.00 


46.32 


,9 


ATCM 


2294 


CB 


SER A 275 


39.547 


32.683 


16.074 


1.00 


15.19 


C 


ATCM 


2295 


OS 


SW A 275 


40.904 


33.070 


16,07? 


1.00 


28,71 


0 


ATCM, 


2296 




LV? A 276 


41,192 


29t7?0 


17 t 037 


1.00 


22-99 


H 


ATCM 


2297 


CA 


A 276 


42.178 


29.791 


16.639 


1,00 


23,29 


_c 


ATCM 


2298 


c 


LYS A 276 


41.645 


27-405 


16.976 


l.op 


29,73 


c 


ATOM 


2299 


9 


LYS A 276 


40.992 


27,206 


18.010 


ItOO 


25.10 


0 


ATOM 


2300 


CB 


LYS A 276 


43.544 


. 29.051 


17.275 


1.00 


19.19 


C 


ATOM 


2,301 


CG 


,LY5 A 276 


43.957 


30.496 


17.219 


1,00 


32.11 


c 




2302 


CD 


LYS A 276 


44,062 


, 30,952 


15.798 


1,00 


22-43 


C 


ATCM 


2303 


CE LYS A 276 


44.930 


32.067 


15.570 


1,00 


23,19 


c 


ATCM 


2304 


NZ 


LYS A 276 


45.454 


32,117 


14.152 


IrOO 


29.42 


N 



30 



35 



40 



45 
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ATOM 2305 N PRO A 277 41.89? 26.476 Ifi.nSS 1.00 36.04 8 

ATOM 2306 CA PRO A 277 41.446 25.087 16.170 1.00 35.93 £ 

ATOM 2307 C PRO A 277 42.022 24.332 17.363 1.00 29.30 £ 

ATOM 2308 Q PRO A 277 43.103 24.650 17. MS 1.00 30.54 £ 

5 ATOM 2309 CB PRO A 277 41.975 24.453 14.B7B 1.00 39.65 £ 

ATOM 2310 CG PRO A 277 13,24? 25,291 14.509 1.9.9.42.99 £ 

ATOM 2311 CD PRO A 277 42.787 26.670 14.892 1.00 37.84 £ 

atom 2312 N A?p a 279 41,273 £3,339 .17.fi.Q9 1.9.9 22..3S H 

KICK 231? £A ASP & 219 41,745 \9,?P3 1,99 22. IS £ 

10 ATOM 2314 C ASP A 278 42.184 21.189 18. 272 1.00 19.66 £ 

ATQM 231$ P ASP A 278 41, 99? 29.917 17,117 1.99 23,49 £ 

ATOM 2316 CB ASP A 278 4 0.636 22.241 19.971 1.00 15,0? £ 

ATOM 2317 CG ASP A 278 40.216 23.503 20.702 1.00 22. B6 £ 

ATJ23 2318 pJa ASP A 278 41.113 24.254 21.096 1.00 25. IB _fi 

15 ATOM 2319 OD2 ASP A 278 38.999 23.787 20.812 1.00 39.55 £ 

AXOf 2320 -N 6MT A 27.9 42.846 20.355 19.044 1.00 30.65 H 

him 2321 ca aria 279 43.229 19.034 ie.546 1.00 33.7s £ 

AI&1 2322- C ELY A 2?9 42.115 18.099 18.944 1.00 38.10 £ 

hjm 2323 -3 GLY A 279 40.963 18.517 19.068 1.00 47.52 £ 

20 ATCM . 232* A THR A 299 42 L U9 15.939 19.177 1.99 29,44 U 

ATOM 2325 CA THR A 280 41.328 15.990 19.587 1.00 26.68 £ 

ATOM 23 26 ,C THR A 289 49 .9 99 1 9 ,439 29 .97 2 . 1 , 99 23. $2 £ 

AX2M 2327 Q THR A 299 41,979 17,997 21,713 1,99 23.92 2 

AlOfl 2329 Cfi THR ft 299 41.695 14,4 92 19.540 1.00 40.78 _£ 

25 ATOM 2329 Wl THR A 289 42,999 H.272 29. 29$ 1,99 25. 55 _£ 

ATOM 2330 CG2 THR A 280 41.B93 14.054 18.095 1.00 37.71 £ 

AIPH 2331 N PRO A 291 3?, $72 19-993 21.34$ 1, 09 25.54 M 

ATOM 2332 CA EES A 29 1 , 39,1 2 9 1 5,454 22.529 1-99 25.72 £ 

ATOM 2333 C PRO A 281 39.776 15.778 23.800 1.00 26.02 £ 

30 ATOM 2334 P PRP A 211 39:752,19,314 24.915 l,99,22..5g £ 

A1PH 2335 CB PRO A 281 37.650 15.990 22.559 1.00 28.89 £ 

ATOM 2336 CG PRO A 281 37.417 15.540 21.201 1.00 29.39 £ 

ATO M . 2 33 7 . CP P ftP A 291 39.161 15 - 13 9 2 9,949 1,99 26.82 £ 

AX2M 2338 N ARG A 282 40.281 14.567 23.587 1.00 27.88 , H 

35 ATPM 2339. ARC A 292 19-806,. 13.91.7 , 24.729 1.90 34,99 _£ 

atom 23 40. p ARG A 2 92 41- 977 . 12 -919 24,384 1,99 ,27.92 £ 

ATOM m P ARP A 292 41,913 12,192 23.435,1,99 23.83 £ 

ATPM 2342 CP APS A 2 92 39 ,676. 13,917 25.495 1,99 29,99 £ 

ATOM 2343 CG ARG A 282 40.035 12.467 26.775 1.00 22.81 £ 

40 ATOM 2344 CD ARG A 282 3B , 762 11.925 27.442 1.00 26.77 £ 

ATOM 2345 ME ABfi A 292 39,993 11,345 29.791 1.99 39,49 H 

ATOM 2346 CZ ARG A 282 38.518 10.139 29.164 1.00 37.74 £ 

ATOM 2347 NH1 ARG A 282 37.813 9.360 28.346 1.00 28.45 B 

ATOM 2348 NH2 ARG A 282 3B.754 9.700 30.384 1.00 27.25 H 

45 ATOM 2349 N LYS A 283 43.016 12.963 25.223 1.00 28.91 ; H 
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AXPtf 2350 CA LYS A 283 

him 2351 C LYS A 283 

him 2252 Q LYS A 263 



44.217 12.171 25.051 



44.796 31.766 26.404 



45.262 12.626 27.138 



him 2353 CB LYS A 283 



45.22$ 12Jfll 24.287 



him 2354 CG LYS A 283 



46.111 12.251 23.316 



him 2255 CD LYS A 283 



46.526 13.171 22.143 



him 2356 CE LYS A 283 



45.719 12, 937 20.83$ 



him 2357 NZ LYS A 283 



46.41? 13.332 19.535 



him 2358 N LEV A 284 



44.747 10,467 26.734 



10 



him 235? CA LEV A ,284 



45.327 9.905 27.997 



him 236Q c LEV A 284 



45.463 8.386 28.047 



him 2361 P LEU A 284 



44.67? 7.655 27.446 



ATOM 2362 CB LEU A 284 



44.641 10.387 29.284. 



him 2363 gfi LEU A 284 



43.334 9. 7QQ 29.714 



15 



him 2214 CDl LEU A 284 



42.881 10.089 31.152 



ATOM, 2365 CD2 leu a 284 



42.203 9.953 28.693 



ATOM 2366 N LEU A 285 



46.45? 7.939 28.820 



him 22£Z CA LEV A 285 



46.792 6. 527 29.003 



him 225fl £ LEU A 285 



45.880 5.865 30.006 



20 



him 2369 P LEV A 285 



45.576 6,439 31r058 



ATOM 2370 CB LEU A 285 



48.229 6.389 29.585 



ATOM 2371 Cg LEV A..285 



49.307 6.970 28.672 



ATPH 2372 CPl LEV A 285 



50.703 6.7Q$ 29.122 



atom 2373 cpz LEV A 285 



49.051 6.368 27.330 




ATOM 2376 C ASP A 286 



46.128 3.055 31.498 



him 2377 P ASP A 286 



46.991 2.372 30.938 



ATOM 2378 Cg ftgp A 286 



44.073 2.702 29.970 



30 



ATOM 2379 CG ASP A 286 



43,409 1 .699 30 .94? 



ATOM 2380 ODl ASP A 286 



43.932 1,437 32.093 



ATOM 2381 PP2 ASP A 286 



42.316 .1.231 30 .5 8 . 3 



ATOM 2382 N VAL A 287 



46,230 3,317 32.79; 



ATOM 2383 CA VAL A 287 



47.354 2.816 33.556 



35 



ATPM 2384 C VAL A 287 



46 .973 1 .695 34 .521 



him 2385 P VAL A 287 



47.613 1.473 35.572 



ATOM 2386 .CP VAL A 287 



4 8 .10; 4. 006 34,260 



ATO M 2397 c gq V AL A 297 



48.534 5.Q95 33.224 



ATOM 2388 CG2 VAL A 287 



47.173 4. 670 35.258 



40 



him 238? n rm a 28b 



45. 904 0.99 2 34 .152 



him 2390 CA 



A 288 



45.428 -0.152 34.956 



ATOM 2391 C THR A 288 



46.561 -1. 177 35.227 



him 239 2 , P THR A 288 



46.778 -1.586 36.365 



ATOM 2393 CB THR A 288 



44.?8B -0.909 34.244 



45 ATOM 2394 OGl THR A 288 



43.120 -0.0.96 34.106 



.00 24.32 



.OP 29.57 



.00 33.16 



.00 21.93 



.00 32,38 



.00 95.77 



00100.00 



.00100.00 



00 23.37 



.00 16.08 



.00 20.46 



i00 25.45 



00 16.30 



.00 25.97 



00 22. U 



■00 23.92 



00 18.51 



.00 16-77 



QQ 30.75 



.00 22.02 



00 15.85 



.00 21.51 



.00 15.15 



.00 16.94 



.00 20.54 



.00 23.38 



00 14.65 



-00 2 4.6Q 



QQ 24.60 



.00 26.03 



00 15.44 



.00 15.58 



.00 16.48 



.00 16.63 



.00 29.84 



>00 18-39 



•00 37.79 



.00 22.27 



.00 19.34 



00 27,47 



00 24.87 



.00 22.86 



.00 24.84 



-2 

M 

Si 
Si 
-2 

Si 



Si 
Si 
.Q 
_P_ 
H 
Si 
Si 
-2 
Si 
Si 
£ 

_C_ 
Si 
Si 

Si 
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AIOl 2395 CS2 THE A 298 43,91f -2,113 ^.fl?4 1.00 2?, 99 £ 

ATOM 2396 N ARG A 289 17.290 ~1»585 34.179 1.00 2gr06 K 

ATOM 2397 CA ARG A 269 48.428 -2.506 34.319 1.00 16.92 £ 

ATOM 2398 C MS A 289 49.405 '2.037 35,408 1.00 22,9$ £ 

5 ATOM 2399 O ARG A 289 4 9.B47 -2.790 36.275 1.00 23.03 , Q 

ATOM 2400 CB ARg A 289 ! 49.208 -2.607 32,97$. 1.00 12,43 £ 

ATOM 2401 CG ARG A 289 48.934 -3.804 32.103 1.00 29.39 £ 

ATOM 2402 CD ARG A 2fl9 50.016 -4.102 31.037 1.00 25. 66 £ 

ATOM 2403 ME ARg A 289 49.441 -4,99$ 30,020 1.00 17,2$ B 

10 ATOM 2404 CZ ARG A 289 50.053 -5.459 28.930 1.00 38.82 £ 

ATOM 2405 NH1 ARG A 289 51. 30$ -5.153 28,$$P 1,QQ 13,51 E 

ATOM 2406 NH2 ARG A 289 49.400 -6.262 28.096 1.00 37.68 2£ 

ATOM 2407 F LEV A 290 4 9.815 ..-0.78$. 35..3PS 1,00 2$.$0 U 

ATOM 2408 CA LEU A 290 50.809 -0.254 36.219 1.00 25.42 C 

15 ATOM 2409 C LEU A 2?0 50.324 -0.376 3 7.656 1.00 24.17 £ 

AX2M 241Q ,.P LEV A 290 51.072 -0.759 38.574 1.00 19.94 £ 

AlflM 2411 CB LEV A 290 51,000 1.219 35.87$ 1.00 24.$$ £ 

A TOM 2412 CG LEU A 290 52.281 2.019 36.066 1.00 24.67 £ 

AI9U 2413 CD! LEU A 290 51.992 3.479 36.504 1.00 29.25 £ 

20 AlflM 2414 CD 2 LEU A 290 53.450 1.335 36. 788 1.00 15.82 £ 

ATPM 2415 W HIS A 291 49.093 0.075 37.868 1.00 30.10 H 

ATOM 241$ OA His A 291 48.513 0.074 39.212 1.00 34,17 £ 

AXSM 2417 C HIS A 291 48.411 -1.367 39.730 1.00 43.41 £ 

ATM 2418 P HIS A 291 48.621 -1.654 40.929 1.00 38.81 Q 

25 ATOM 2419 CB HIS A 2 91 47.113 Q,g74 39.143 1.00 .28,01 £ 

ATOM 2420 CG HISA2gl 47.097 2.153x 38.984 1.0O 29.68 £ 

ATOM 2421 NP1 HIS A 291 48.242 2.921 39.015 1,QQ ?5.$3 H 

ATOM 2422 CD2 HIS A 291 46.068 3.024 38.855 1.00 31.18 ; £ 

ATOM 2423 CE1 HIS A 291 47.926 4 .197 36.845 1.00 24.20 £ 

30 ATOM 2424 NE2 HIS A 291 46.612 4.289 38.747 1.00 21.92 H 

ATOM 2425 N GLN A 292 48.048 -2.260 38.821 1.00 30.73 H 

Aiai 24Z6 ca SLN A 292 47,950 -3,S54 39.181 1-00 34.82 £ 

ATOM 2427. £ SLH A 292 49.287 -4.197 39, $22 1-QQ 3$,?3 £ 

ATffH 2428 O GLN A 292 49.323 -5.040 40.510 1.00 27.56 ; Q 

35 ATOM 2429 CB PLN A 29? 47.322 -4,487 38, 0$9 1,00 28.23 £ 

ATOM 2430 CG GLN A 2 92 4 5.798 -4.405 38.171 1.00 B1.15 £ 

ATOM 2431 CD GLN A 292 45 .023 -4.954 36.963 1.00100.00 , £ 

ATOM 2432 OE1 GLN A 292 45.597 -5.410 35.951 1.00 99.65 Q 

ATOM 2433 NE2 GLN A 292 4 3.687 -4.895 37.073 1.00 40.86 12 

40 ATOM 2434 N LEV A 29.3 50.375 -3 t $58 39.058 1.00 31,75 : H 

ATOM 2435 CA LEU A 293 5 1.750 -4.072 39.383 1.00 22.67 £ 

ATOM 2436 C LEU A 293 52.238 -3 .323 40.613 1.00 28.64 C 

ATOM 2437 O LEU A 293 53.420 -3.377 41.017 1.00 22.27 Q 

ATOM 2438 CB LEU A 29? 5 2.665 -3.769 38.205 1.00 25.57 , £ 

45 ATOM 2439 CG LEU A 2 93 52.497 -4.703 37.016 1.00 35.11 ; £ 
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AXflM 2440 CD1 LEU A 293 53.306 -4.170 35.836 1.00 28.25 £ 

ATOM 2441 CD 2 LEU A 293 52.965 -6.110 37.439 3.00 47.81 £ 

AX2M 2442 N GLY A 294 51.316 -2.510 41.111 1.00 33.08 H 

ATOM 2443 CA GLY A 294 51.4B8 -1.793 42.347 1.00 24.90 C 

5 ATOM 2444 C GLY A 294 52.272 -0.512 42.326 00 29.31 £ 

ATCM 2445 O GLY A 294 53.070 -0.249 43.223 1.00 25.25 O 

ATOM 2446 W TRP A 295 52.000 0.347 41.368 1.00 27.83 g 

ATOM 2447 CA TRP A 295 52.687 1.623 41.385 1.00 19.45 £ 

ATOM 2148 C TRP A 2?? 51,564 2.731 41,081 l.QQ 25,79 £ 

10 ATOM 2449 O TRP A 295 50.765 2.527 40.297 1.00 20.43 £ 

ATOM 2450 CB TRP A 295 53.961 1.614 40.524 1.00 12.85 £ 

ATOM 2451 CG TRP A 295 54.750 2.911 40.618 1.00 23.04 £ 

ATOM 2452 CPl TRP A 295 55.897 3.161 41.368 1.00 23. £g £ 

ATOM 2453 CD2 TRP A 295 54.415 4.159 39.979 1.00 20.72 £ 

15 ATOM 2454 WEI TRP A 295 56.258 4.493 41.244 1.00 18.67 H 

ATOM 2455 CE2 TRP A 295 55.389 5.113 40.373 1.00 20.95 £ 

AXQH 2456 CE3 TRP A 295 53.406 4.550 39.102 1.00 21.47 £ 

AXSM 2457 CZ2 TRP A 295 55.338 6.439 39.958 1.00 17.58 £ 

AXSM 2458 CZ3 TRP A 295 53.403 5.873 38.632 1.00 21.57 £ 

20 ATOM 2459 CH2 TRP A 295 54.368 6.787 39.058 1.00 19.45 £ 

AXSH 2460 W TYR A 296 51.709 3.797 41.884 1.00 25.17 H 

AXgtf CA TYR A 296 50.720 4.883 41.731 1.00 24.90 C 

AISM 2462 C TYR A 296 51.517 6.178 41.857 1.00 30.85 _£ 

AXQM 2463 Q TYR A 296 52.363 6.272 42.745 1.00 21.27 £ 

^j^^AI^ 

ATCM 2465 CG TYR A 296 48.685 3.651 42.744 1.00 23.04 £ 

ATOM 2466 CD1 TYR A 296 49.078 2.343 43.088 1.00 31.62 £ 

AlfiM 2467 CP2 TYR A 296 47r39P 3.853 42.289 l.oo 26.02 £ 

AXStt 2468 CE1 TYR A 296 48.203 1.268 42.935 1.00 24.42 £ 

30 ATOM 2469 CE2 TYR A 296 46.493 2.770 42.127 1.00 24.81 £ 

A12M 2470 CZ TYR A 296 46.902 1.483 42.464 1.00 39.41 £ 

A1SM 2471 OH TYR A 296 45.984 0.434 42.337 1 .00 66.19 £ 

ATOM 2472 W HIS A 297 51.324 7.123 40.924 1.00 20.95 H 

ATCM 2473 CA HIS A 297 52.130 8.343 40.938 1.00 26.86 £ 

35 AI£M 2474 C HIS A 297 51.947 9.175 42.210 1.00 35.01 £ 

ATOM 2475 O HIS A 297 50.885 9.132 42.874 1.00 26.92 £ 

A1CM 2476 CB HIS A 297 51.819 9.192 39.733 1.00 25.77 £ 

ATOM 2477 CG HIS A 297 50.489 9.842 39.803 1.00 31.16 £ 

him 2478 NDl HIS A 297 49.314 9.145 39.633 1.00 34.21 H 

40 aiOJ 2479 CD2 HIS A 297 SO. 135 11.094 40.167 1.00 25.83 £ 

A1SM 2480 CE1 HIS A 297 48.290 9.972 39.776 1.00 24.14 £ 

AlfiM 2491 WE2 HIE A 297 48.7<?; 11,164 4P.P97 l.PQ 23.35 K 

A1QM 2482 N GLU A 298 52.983 9.926 42.554 1.00 24.98 H 

ATOM 2483 CA GLU A 298 52.957 10.683 43.798 1.00 27.65 £ 

45 AIOl 2494 C GLU A 298 52.831 12.187 43.741 1.00 36.86 £ 
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?iTCM 2485 o glu A 298 5?!, 433 12,792 44,718 1.90 43.61 Q. 

frTQM 2486 CB GLU A 29B 54.153 1Q.319 44.686 1.00 22.02 „ £ 

ATOM 2487 CG GLU A 298 54.004 8.943 45.285 1.00 36.42 £ 

ATOM 2488 CD GLU A 298 54.999 8.664 46.406 1.00100.00 £ 

5 ATOM 2489 OE1 GLU A 298 56.223 8.561 46.152 1.00 44.79 O 

flTQM 2490 OE2 GLU A 298 54.526 8.470 47.S47 1.00100.00 £ 

atom 2491 n ile a 299 $3.232 12.800 42. 63? 1.99 23.49 H 

ATOM 2492 CA ILE A 299 53.268 14.244 42.562 1.00 13.25 £ 

ftTOM 2493 C ILE A 299 52.016 14.848 41.906 1.00 27.05 £ 

10 ATOM 2494 O ILE A 299 51.681 14.530 40.757 1.00 26.73 _Q 

ATOM 2495 CB ILE A 299 54.586 14.711 41.862 1,00 15.93 £ 

ATOM 2496. cgi ILE A 299 . 55.936, 14.1*3 42.696.. 1.99 23.93 £ 

ATOM 2497 CG2 ILE A 299 54.596 16.213 41.541 1.00 17.37 £ 

ATOM 2498 CDl ILE A 299 57.232 14.221 41.7B7 1.00 21.32 £ 

15 axes ?499 M SER A 300 ; 51.323 15.716 42,6*9 1.9P 19.55 H 

ATOM 2500 CA SER A 300 50.177 16.449 42.091 1.00 19.58 £ 

ijm 2591 c SER A 399 59.714 17.415 41.942 1.99 17.29 £ 

ATOM 2502 0 SER A 300 5 1.824 17.941 41.178 1.00 21.06 £ 

ATOM 2503 CB SER A 300 49.542 17.307 43.181 1.00 16.78 £ 

20 ATOM 2594 W SER A 399 59.548 17.969 43.923 1,99 75. 9P Q 

frTOM 2505 N LEU A 301 49.870 17.755 40.075 1.00 16.13 fi 

AIOS 2506 CA LEU A 301 50.246 18.675 39.014 1.00 17.70 £ 

ATOM 2507 C LEU A 301 50.689 19.964 39.646 1.00 20.11 C 

ATOM 2508 O LEU A 301 51.714 20.568 39.303 1.00 20.46 O 

25 ATOM 2509 CB LEU A 301 48.990 18.981 38.197 1.00 17.92 £ 

ATOM 2510 CG LEU A 301 49.182 20.030 37.112 1.00 25.15 £ 

ATOM 2511 CDl LEU A 301 50.233 19.552 36.996 1.99 1,9.92 £ 

AT9M 2512 CP2 LEV A 391 47.954 20.177 36.436 1.99 25.99 £ 

ATOM 2513 N GLU A 302 49.845 20.398 40.554 1.00 27.01 & 

30 ATOM 2514 CA GLU A 302 50.053 21.636 41.2B0 1.00 37.72 C 

ATOM 2515 C GLU A 302 5 1.410 21.618 41.996 1.00 29.99 £ 

ATOM 2516 O GLU A 302 52.245 22.514 41.798 1.00 27.15 £ 

ATOM 2517 CB GLU A 302 48.899 21.841 42.275 1.00 43.10 £ 

ATOM 2518 CG GLU A 302 49.061 23.061 43.174 1.00 90.85 £ 

35 ATOM 2519 CD GLU A 302 48.451 24.324 42.599 1.99199.99 £ 

ATOM 2520 OE1 GLU A 302 47.566 24.209 41.706 1. 00100.00 Q 

ATOM 2521 OE2 GLU A 302 48.808 25.432 43.036 1.00 64.50 £ 

atom 2522 w ALA A 303 51.646 20.591 42.901 , 1.00 9.72 H 

ai25 2523 CA ALA A 303 52.937 20.455 43.459 1.00 15.03 £ 

40 ATOM 2524 C ALA A 303 54.102 20.355 42.450 1.00 19.85 £ 

ATOM 2525 O ALA A 303 55.104 21.090 42 . 553 1 . 00 22 ■ 24 Q 

ATOM 2526 CB ALA A 303 52.938 19.258 44.419 lt99 19. 9? £ 

ATOM 2527 N GLY A 304 53.953 19.4 72 41.467 1.00 13.05 H 

SUM 2528 CA GLY A 304 54.970 19.321 40.448 1.00 8.94 £ 

45 ATOM 2529 C, GLY A 304 55.239 20.621 39.695 1.00 20.31 £ 
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him 2530 „ P GLY A 304 



56.394 20.900 39.322 



ATOM. 2531 N LEU A 305 



54,191 71,383 39,361 



ATOM 2532 CA LEU A 305 



54.483 22.622 38.611 



ATOM 2533 C LEU A 305 



55.281 23. 669 39.456 



AI0H 2534 P LEV A 305 



56,194 24,395 39,971 



ATOM 2535 CB LEU A 305 



53.202 23.245 38,033 



ATOM 2536 CG LEU A 305 

ATOM 2537 CP1 EEV A 305 



52.357 22.647 36.890 



50.975 23,394 36.799 



ATOM 2538 CD2 LEU A 305 



53.079 22.724 35.543 



10 



AXGM 2539 W ALA A 306 



54.904 23.757 40.724 



AI0H 2540 CA ALA A 306 



55.544 24.660 41.655 



ATOM 2541 C ALA A 306 



57.035 24.380 41.743 



2542 Q ala a 306 



57.952 25.290 41.662 



ATOM 2543 CB ALA A 306 



54.937 24.471 43.002 



15 



ATOM 2544 SEE A 3 07 



57.378 23.137 42.011 



AlfiM 2545 CA SER A 307, 

ATOM 2546 C SBR A 307 



2547 Q SEP A 3Q7 



58.793 22.756 42.162 



59.547 22.885 40.832 



20 



A3 
A3 
A3 



60.742 23,212 40.786 



2548 CB SER A 307 

2549 OS SER A 307 



58.951 21,304 42,622 



58.517 20.454 41.526 



ATOM 2550 N 



A 308 



58.849 22.631 39.735 



ATOM 2551 CA TKR A 308 



59.456 22.738 38.413 



AXP24 2552 C THR A 309 



59.757 24.216 38.107 



A3 



2553 P THR A 308 



60.919 24,546* 37,591 



,00 J18.72 




AXSM 2556 CS2 



A 309 



59,094 22.330 35,923 



AI2M 2557 N TYR A 309 



58.846 25.119 38.453 



AICM 2559 CA TYR A 309 



59.110 26.549 38.241 



30 



ATOM 2559 C TYR A 309 



60.383 27.0 59 39.045 



ATOM 2560 Q TYR A 309 



61.179 27-858 38.577 



A3 



2132 CB TY R A 30 9 



57.919 27.373 36,533 



ATOM 2562 CG TYR A 309 



57.944 28.895 ::.392 



ATOM 2563 CP1 TYR A 3Q9 



58.397 29,457 37,224 



35 



ATOM 256 4 CD2 TYR A 309 



57.575 29.757 39.442 



AX2H 2565 CE1 TYR A 3Q9 



58.527 30.801 37.100 



ATOM 2566 CE2 TYR A 309 



57.744 31.129 39.351 



ATOM 2567 CZ TYR A 309 



58.212 31.641 38.164 



AI2M 2569 PH TYR A 309 



58.300 33.004 37.966 



40 



ATOM 25 69 N GLN A 310 



60.560 26.579 40.260 



ATOM 2570 CA SLH A 310 



61.705 26.964 41.097 



AXPJ4 2571 c SLN A 310, 



63,001 26t492 40.446 



ATOM 2572 O GLN A 310 



64.009 27.191 40.442 



,00 14,30 



,00 10,76 



,00 20.29 



00 28,92 



,00 17.69 



»oo 24.03 



00 27.66 



,00 13.44 



.00 18.39 



.00 19.94 



■00 24.79 



,00 27.51 



,00 29,68 



.00 17.87 



,00 19.46 



,00 16.31 



.00 22.66 



.00 ?e,47 



.00 20,47 



.00 29.03 



■00 27.31 



00 22.89 



,00 26.06 



.00 29.89 



rQQ . 1 2,37 



,00 28,20 



.00 31,09 



.00 16-31 



-QQ 16.91 



.00 31.19 



iQO 14.57 



.00 17.51 



,QQ 24.99 



■QQ 18.41 



.00 19. 04 



■00 29.13 



.00 28.22 



00 15,41 



.00 22.35 



,00 31,46 



.00 33.q2 



ATOM 257 3 CB GLN A 310 



61.597 26,335 42.492 1.00 17.67 



45 



ATOM 2574 CG GLN A 310 



62.579 26 .921 43.461 1.00 57.58 



-2 

-£ 
-Q 

Ji 

-£ 
-2 



Ji 
Si 
Si 

Si 
Si 

Ji 

Si 

Si 
Si 

SI 
Ji 
Si 
Si 



Si 



WO 99/64618 



PCT/US99/11576 



139 

AT-OM 2575 CP QLK A 319 62.,?87 29,37p 43 ,7J?3 1.00 65 T 14 £ 

ATOM 2.57,6 PE1SLNA.31P 61 .334 28.754 44.000 1.00 41.94 £ 

ATOM 2577 NE2 GLN A 310 63.330 29.194 43.801 1.00 99 r 09 ft 

ATOM 2576 N TRP A 311 62.957 25.321 39.830 1.00 26.76 H 

5 Aign 2579 CA TRP A 311 64,146 21.822 39.163 1,90 26.29 £ 

A152J 2590 C TRP A 311 64.171 25.76? 39.P4P 1,00 17.21 £ 

AZ&U 2591 Q TRP A 311 65.599 26.193 37.860 1.00 22.89 Q 

him 2592 CP TRP A 311 63.939 23.393 39.643 l.PP 27.?? £ 

ATOM 2563 CG TRP A 311 65.176 22.764 38.119 1.00 17.62 £ 

10 Aipn 2594 CP1 TRP A 311 66.132 22,P9P 39.926 1,00 20,21 £ 

Ala* 2595 CP2 TRP A 311 65.652 22.981 36.794 1.00 17.?? £ 

ATOM 2586 NEl TRP A 311 67.197 21.776 37.992 1.00 20.39 H 

ATOM 2587 CE2 TRP A 311 66.933 22.264 36.746 1.00 19.57 £ 

ATOM 2586 CE3 TRP A 311 65.141 23.461 35.621 1.00 20.26 £ 

15 AlOU 2589 CZ2 TRP A 311 67.666 22.236 35.599 1.00 14.25 £ 

ATOM 2590 CZ3 TRP A 311 65.901 23.446 34.50 1 1.00iy r $9 £ 

ATOM 2591 CH2 TRP A 311 67.169 22.831 34. 494 1.00 16.86 £ 

ATOM 2592 P PHE A 312 63.469 26.109 37.256 1.00 17.47 H 

ATOM 2593 CA PHE A 312 63.665 27.064 36.179 1.00 20.14 £ 

20 AIPH 2594 C PHE A 312 64.224 28.371 36.733 1.00 18.33 C 

AIS2M 2595 Q PHE A 312 65.080 29.024 36.104 1.00 24.76 Q 

ATOM 2596 CB PHE A 312 62.328 27.318 35.4 58 1.00 29.51 £ 

Aiai 2597 CG PHE A 312 62.328 28.544 34.603 1.00 26.52 £ 

AXflM 2599 CPl PHE A 312 62.883 28.508 33.338 1.00 30.53 £ 

25 ATOM 2599 CP2 PHE A 312 ; 61.825 29.758 35.104 1.00 ?? r 31 C 

ATOM 2600 CE1 PHE A 312 62.936 29.660 32.554 1.00 34.73 £ 

ATOM 2601 CE2 PHE A 312 61.900 30.904 34.362 1.00 38.40 £ 

ATOM 2602 C2 PHE A 312 62.432 30.860 33.06 3 1.00 40.73 £ 

ATOM 2603 N LEU A 313 63.697 28.787 37.876 1.0Q 22.46 ft 

30 ATOM 2604 CA LEU A 313 64.170 30.025 38.516 1.00 28.47 C 

ATOM 2605 C LEU A 313 65.627 29.827 38.898 1.00 37.53 £ 

AI2M 2606 P LEV A 313 66,452 3P.693 39.629 1,00 34.20 Q 

ATOM 2607 CB LEU A 313 63.375 30.410 39.783 1 .00 20.44 C 

ATOM 2608 CG LEU A 313 61.955 30.897 39.555 1.00 16.29 £ 

35 ATOM 2609 CD1 LEU A 313 61.499 31.399 40.871 1.00 15.94 £ 

ATOM 2610 CD2 LEU A 313 61.959 31.961 38.524 1.00 14.44 £ 

ATOM 251 1 N GLU A 314 65.953 28.665 39.508 1.00 30.70 H 

ATOM 2612 CA GLU A 314 67.353 28.432 39.875 1.00 24.15 £ 

ATOM 2613 C GLU A 314 68.291 28.149 38.703 1.00 36.34 £ 

40 atom U2S o GJtV A 314 6<M95 28.047 38 T ?9P l.PP 43.10 2 

ATOM 2615 CB GLU A 314 67.459 27.366 40.947 1.00 19.90 £ 

ATOM 2616 CG GLU A 314 66.634 27.754 42.141 1.0 0 27.37 C 

ATPM 2617 CP SLV A 314 , 66,450 26.666 43.192 1.00 31..09 £ 

ATOM 2618 OEl GLU A 314 67.157 25.648 43.085 1.00 59.60 Q 

45 A12M 26 19 QE 2 SLH A 314 65 . 6 3 4 2 6.,972 44.125 1. PP.46. 20 2 
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10 



15 



25 



ATOM 2^2.9 . N 

ATOM 2621 CA 



ASN A 315 



67.779 28.114 37.479 1.00 40.17 



ASW A 315 



68.637 27.802 36.343 1.00 37.76 



Aiai ZS2i £ ASW A 315 



6B.383 28.578 35.112 1.00 43.75 



AXffll 2623 . Q ASH A 315 



68.591 28.001 34.047 1.00 39.15 



ATOM 2624 CB ASM A 3J5 



6P.425 2S,3E0 35.894 l.PP 33.74 



ATOM 2S25 ASW A 315 



69.02P ?5,3Q3 3§ y g,Pl 1,00 53. lg 



ATOM 2626 0D1 ASM A 315 



60.456 25.097 37.335 1,00 13.13 



AlffH 2627 .NP2 ASM A 315 



ATOM, 2KS W 
ATOM 2623 CA 
ATOM , 2630 C, 



GLN A ?16 



70.239 24.926 36.479 1.00 97.72 
67.852 29.803 35.197 1.00 49.87 



SIM A 316 



67.627 30.550 33.957 1.00 77.90 



sm a u& 



68.797 31.448 33.525 1.00100. 00 



ATOM 2631 P CLN A 511 



69.272 31.387 32.375 1.00 51.33 



ATOM 2632 CB GLN A 3 T 6 



66.280 31.276 33.902 1.00 75.89 



ATOl 26?3 Cg CLN A 316 



65.683 31.589 35.231 1.00 80.97 



AlfiM 2634 CP gLN A 316 



65.233 33.036 35.350 1.00 54.58 



ATOM 2635 OBI GLN A 316 



64.881 33.699 34.367 1.00 46.46 



ATOM 2636 NE2 SIB A .31.6 



65.257 33.538 36.566 1.00 33.46 
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While various embodiments of the present invention have been described in detail, 
it is apparent that modifications and adaptations of those embodiments will occur to those 
skilled in the art. However, it is to be expressly understood that such modifications and 
adaptations are within the spirit and scope of the present invention, as set forth in the 
5 following claims. 
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What is claij 

1 . A method for producing ascorbic acid or esters thereof in a microorganism, 
comprising culturing a microorganism having a genetic modification to increase the action 
of an enzyme selected from the group consisting of hexokinase, glucose phosphate 

5 isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose 
pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-Y-lactone dehydrogenase; and recovering said ascorbic acid or esters thereof. 

2. A method, as claimed in Claim 1, wherein said genetic modification is a 
10 genetic modification to increase the action of an enzyme selected from the group 

consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-Y -lactone dehydrogenase. 

3. A method, as claimed in Claim 1, wherein said genetic modification is a 
15 genetic modification to increase the action of an epimerase that catalyzes conversion of 

GDP-D-mannose to GDP-L-galactose. 

4. A method, as claimed in Claim 3, wherein said genetic modification is a 
genetic modification to increase the_ a^ion^of,^^ 
epimerase. 

20 5. The method of Claim 3, wherein said genetic modification comprises 

transformation of said microorganism with a recombinant nucleic acid molecule that 
expresses said epimerase. 

6. The method of Claim 5, wherein said epimerase has a tertiary structure that 
substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 

25 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code Ibws. 

7. The method of Claim 5, wherein said epimerase has a structure having an 
average root mean square deviation of less than about 2.5 A over at least about 25% of 
Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 

30 reductase represented by atomic coordinates having Brookhaven Protein Data Bank 
Accession Code lbws. 
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8. The method of Claim 5, wherein said epimerase has a tertiary structure 
having an average root mean square deviation of less than about 1 A over at least about 
25% of Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 

5 Bank Accession Code lbws. 

9. The method of Claim 5, wherein said epimerase comprises a substrate 
binding site having a tertiary structure that substantially conforms to the tertiary structure 
of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 
represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 

10 lbws. 

10. The method of Claim 9, wherein said substrate binding site has a tertiary 
structure with an average root mean square deviation of less than about 2.5 A over at least 
about 25% of Ca positions of the tertiary structure of a substrate binding site of a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having 

15 Brookhaven Protein Data Bank Accession Code lbws. 

1 1 . The method of Claim 5, wherein said epimerase comprises a catalytic site 
having a tertiary structure that substantially conforms to the tertiary structure of the 
catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws. 

20 12. The method of Claim 1 1, wherein said catalytic site has a tertiary structure 

with an average root mean square deviation of less than about 2.5 A over at least about 
25% of Ca positions of the tertiary structure of a catalytic site of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code lbws. 

25 13. The method of Claim 1 1 , wherein said catalytic site comprises the amino 

acid residues serine, tyrosine and lysine. 

14. The method of Claim 13, wherein tertiary structure positions of said amino 
acid residues serine, tyrosine and lysine substantially conform to tertiary structure 
positions of residues Serl07, Tyrl36 and Lysl40, respectively, as represented by atomic 

30 coordinates in Brookhaven Protein Data Bank Accession Code lbws. 

15. The method of Claim 5, wherein said epimerase binds NADPH. 
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16. The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 50% of non-Xaa residues in SEQ ID NO: 11. 
5 17. The method of Claim 5, wherein said epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 75% of non-Xaa residues in SEQ ID NO: 1 1 . 

18. The method of Claim 5, wherein said epimerase comprises an amino acid 
1 0 sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 

amino acid residues in said amino acid sequence align with 100% identity with at least 
about 90% of non-Xaa residues in SEQ ID NO: 1 1 . 

19. The method of Claim 5; wherein said epimerase comprises an amino acid 
sequence having at least 4 contiguous amino acid residues that are 100% identical to at 

15 least 4 contiguous amino acid residues of an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO.8 and SEQ ID 
NO: 10. 

20. The method of Claim 5, wherein said recombinant nucleic acid molecule 
comprises a nucleic acid sequence comprising at least about 12 contiguous nucleotides 

20 having 100% identity with at least about 12 contiguous nucleotides of a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO;9. 

21. The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Qy. 

25 22. The method of Claim 5, wherein said recombinant nucleic acid molecule 

comprises a nucleic acid sequence that is at least about 15% identical to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

30 23. The method of Claim 5, wherein said recombinant nucleic acid molecule 

comprises a nucleic acid sequence that is at least about 20% identical to a nucleic acid 
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sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

24. The method of Claim 5, wherein said recombinant nucleic acid molecule 
5 comprises a nucleic acid sequence that is at least about 25% identical to a nucleic acid 

sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID N0.7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

25. The method of Claim 5, wherein said recombinant nucleic acid molecule 
10 comprises a nucleic acid sequence that hybridizes under stringent hybridization conditions 

to a nucleic acid sequence encoding a GDP-4-keto-6~deoxy-D-mannose epimerase/ 
reductase. 

26. The method of Claim 25, wherein said nucleic acid sequence encoding said 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is selected from the group 

15 consisting of SEQ ID NO: 1, SEQ ID NO:3 and SEQ ID NO:5. 

27. The method of Claim 25, wherein said GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase comprises an amino acid sequence selected from the group consisting 
of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6. 

28. A method, as claimed in Claim 1, wherein said microorganism is selected 
20 from the group consisting of bacteria, fungi and microalgae. 

29. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant. 

30. A method, as claimed in Claim 1, wherein said microorganism is a 
bacterium. 

25 3 1 . A method, as claimed in Claim 30, wherein said bacterium is selected from 

the group consisting of Azotobacter and Pseudomonas. 

32. A method, as claimed in Claim 1, wherein said microorganism is a fungus. 

33. A method, as claimed in Claim 32, wherein said microorganism is a yeast. 

34. A method, as claimed in Claim 33, wherein said yeast is selected from the 
30 group consisting of Saccharomyces yeast. 
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35. A method, as claimed in Claim 1, wherein said microorganism is a 
microalga. 

36. A method, as claimed in Claim 35, wherein said microalga is selected from 
the group consisting of microalgae of the genera Prototheca and Chlorella. 

5 37. A method, as claimed in Claim 36, wherein said microalga is selected from 

the genus Prototheca 

38. A method, as claimed in Claim 1, wherein said microorganism further 
comprises a genetic modification to decrease the action of an enzyme having GDP-D- 
mannose as a substrate, other than GDP-D-mannose:GDP-L-galactose epimerase. 
10 39. A method, as claimed in Claim 38, wherein said genetic modification to 

decrease the action of an enzyme having GDP-D-mannose as a substrate, other than GDP- 
D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP-D-mannose-dehydrogenase. 

40. A method, as claimed in Claim 1, wherein said microorganism is acid- 
15 tolerant and said step of culturing is conducted at a pH of less than about 6.0. 

41. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 5.5. 

42. A method, as claimed in Claim 1, wherein said nucrQorgMis!ELis^a.cid=_ 
tolerant and said step of culturing is conducted at a pH of less than about 5.0. 

20 43. A method, as claimed in Claim 1, wherein said step of culturing is 

conducted in a fermentation medium that is magnesium (Mg) limited. 

44. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that is Mg limited during a cell growth phase. 

45. A method, as claimed in Claim 1, wherein said step of culturing is 
25 conducted in a fermentation medium that comprises less than about 0.5 g/L of Mg during 

a cell growth phase. 

46. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises less than about 0.2 g/L of Mg during 
a cell growth phase. 
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47. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises less than about 0. 1 g/L of Mg during 
a cell growth phase. 

48. A method, as claimed in Claim 1, wherein said step of culturing is 
5 conducted in a fermentation medium that comprises a carbon source other than D- 

mannose. 

49. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises glucose as a carbon source. 

50. A microorganism for producing ascorbic acid or esters thereof wherein 
10 said microorganism has a genetic modification to increase the action of an enzyme 

selected from the group consisting of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose: GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose-l-P-phosphatase, L-galactose dehydrogenase, and L-galactono-y-lactone 
15 dehydrogenase. 

51. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of an enzyme selected from 
the group consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1-P-phosphatase, L-galactose dehydrogenase, and L- 

20 galactono-Y-lactone dehydrogenase. 

52. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of GDP-D-mannose:GDP-L- 
galactose epimerase. 

53 . A microorganism, as claimed in Claim 50, wherein said microorganism has 
25 been genetically modified to express a recombinant nucleic acid molecule encoding an 

epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein 
said epimerase has a tertiary structure having an average root mean square deviation of 
less than about 2.5 A over at least about 25% of Ca positions of the tertiary structure of 
a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
30 coordinates having Brookhaven Protein Data Bank Accession Code lbws. 
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54. A microorganism, as claimed in Claim 50, wherein said microorganism is 
selected from the group consisting of bacteria, fungi and microalgae. 

55. A microorganism, as claimed in Claim 50, wherein said microorganism is 
a bacterium. 

5 56. A microorganism, as claimed in Claim 55, wherein said bacterium is 

selected from the group consisting of Azotobacter and Pseudomonas. 

57. A microorganism, as claimed in Claim 50, wherein said microorganism is 
a fungus. 

58. A microorganism, as claimed in Claim 57, wherein said microorganism is 

10 a yeast. 

59. A microorganism, as claimed in Claim 58, wherein said yeast is selected 
from the group consisting of Saccharomyces yeast. 

60. A plant for producing ascorbic acid or esters thereof, wherein said plant 
has a genetic modification to increase the action of an enzyme selected from the group 

i 

15 consisting of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L- 
galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L- 
galactose dehydrogenase, and L-galactonp-v^lactQ^ 

61. A plant, as claimed in Claim 60, wherein said genetic modification is a 
20 genetic modification to increase the action of an enzyme selected from the group 

consisting of GDP-P-mannose:Gt)P-L-gaIactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-pho^}hatase, L-galactose dehydrogenase, and L- 
galactono-Y -lactone dehydrogenase. 

62. A plant, as claimed in Claim 60, wherein said genetic modification is a 
25 genetic modification to increase the action of GDP-D-mannose: GDP-L-galactose 

epimerase. 

63. A plant, as claimed in Claim 60, wherein said plant has been genetically 
modified to express a recombinant nucleic acid molecule encoding an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase 

30 has a tertiary structure having an average root mean square deviation of less than about 
2.5 A over at least about 25% of Ca positions of the tertiaiy structure of a GDP-4-keto-6- 
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deoxy-D-mannose epimerase/reductase represented by atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws. 

64. A plant, as claimed in Claim 60, wherein said plant further comprises a 
genetic modification to decrease the action of an enzyme having GDP-D-mannose as a 

5 substrate other than GDP-D-mannose:GDP-L-galactose epimerase. 

65. A plant, as claimed in Claim 60, wherein said genetic modification to 
decrease the action of an enzyme having GDP-D-mannose as a substrate other than GDP- 
D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP-D-mannose-dehydrogenase. 

10 66. A plant, as claimed in Claim 60, wherein said plant is a microalga. 

67. A plant, as claimed in Claim 66, wherein said plant is selected from the 
group consisting of microalgae of the genera Prototheca and Chlorella. 

68. A plant, as claimed in Claim 66, wherein said microalga is selected from 
the genus Prototheca. 

15 69. A plant, as claimed in Claim 60, wherein said plant is a higher plant. 

70. A plant, as claimed in Claim 60, wherein said plant is a consumable higher 

plant. 

71. A microorganism for producing ascorbic acid or esters thereof, wherein 
said microorganism has been genetically modified to express a recombinant nucleic acid 

20 molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein said epimerase comprises an amino acid sequence that aligns with SEQ 
ID NO: 11 using a CLUSTAL alignment program, wherein amino acid residues in said 
amino acid sequence align with 100% identity with at least about 50% of non-Xaa 
residues in SEQ ID NO : 1 1 . 

25 72. A plant for producing ascorbic acid or esters thereof, wherein said plant 

has been genetically modified to express a recombinant nucleic acid molecule encoding 
an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein 
said epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 1 1 using 
a CLUSTAL alignment program, wherein amino acid residues in said amino acid sequence 

30 align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 1 1 . 
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SEQUENCE LISTING 
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<110> Berry, Alan 

Running, Jeffrey A, 
Sever son, David K. 
Burlingame, Richard P. 

<120> "VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS" 

<130> 3161-24-PCT 

<140> not yet assigned 
<141> 1999-05-25 

<150> 60/125,073 
<151> 1999-03-17 

<150> 60/125,054 
<151> 1999-03-18 

<150> 60/088,549 
<151> 1998-06-08 

<160> 15 

<170> Patentln Ver. 2.0 



<210> 1 
<211> 1583 
^212^^NA 

<213> Arabidopsis thaliana 




<220> 

<221> CDS 

<222> (49) . . (993) 



<400> 1 

tagtctttaa tttcgcagcg tttttataat tgtgcagagg tttcgtcc atg tct gac 

Met Ser Asp 
1 



57 



aaa tct gcc aaa ate ttc gtc gcg ggt cat cgt ggt ttg gtt gga tct 
Lys Ser Ala Lys lie Phe Val Ala Gly His Arg Gly Leu Val Gly Ser 
5 10 15 



105 , 



gcc att gtc cgc aag ctt cag gaa caa ggt ttc acc aat etc gtt ctt 
Ala He Val Arg Lys Leu Gin Glu Gin Gly Phe Thr Asn Leu Val Leu 
20 25 30 35 



153 



1 
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aaa aca cac gee gag ctt gat etc act cgt caa gec gat gtt gaa tec 201 
Lys Thr His Ala Glu Leu Asp Leu Thr Arg Gin Ala Asp Val Glu Ser 

40 45 50 

ttc ttt tct caa gag aag cca gtt tat gta ate eta gca gca get aaa 249 
Phe Phe Ser Gin Glu Lys Pro Val Tyr Val He Leu Ala Ala Ala Lys 

55 60 65 

gtt ggt ggt att cac get aac aac ace tat cct get gat ttc att ggt 297 
Val Gly Gly He His Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie Gly 
70 75 80 

gtc aat etc cag att cag acc aat gtg ate cac tct gca tat gag cac 345 
Val Asn Leu Gin He Gin Thr Asn Val He His Ser Ala Tyr Glu His 
85 90 95 

ggt gtg aag aag ctt etc ttc ctt gga tea tec tgc att tac cct aaa 393 
Gly Val Lys Lys Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro Lys 
100 105 110 115 

ttt get cct cag cca att cct gag tct get ttg tta aca gca teg ctt 441 
Phe Ala Pro Gin Pro He Pro Glu Ser Ala Leu Leu Thr Ala Ser Leu 

120 125 130 

gaa cca act aat gag tgg tat get att get aag ate get ggg att aag 489 
Glu Pro Thr Asn Glu Trp Tyr Ala He Ala Lys He Ala Gly He Lys 

135 140 145 

act tgt cag get tat agg att cag cac gga tgg gat gca ate tct ggc 537 
Thr Cys Gin Ala Tyr Arg lie Gin His Gly Trp Asp Ala He Ser Gly 
150 155 160 

atg cct act aat etc tat ggt cct aat gac aat ttc cac ccg gag tct 585 
Met Pro Thr Asn Leu Tyr Gly Pro Asn Asp Asn Phe His Pro Glu Ser 
165 170 175 

cat gtg ctt cct get ctt atg agg agg ttc cac gag gcg aaa gtg aat 633 
His Val Leu Pro Ala Leu Met Arg Arg Phe His Glu Ala Lys Val Asn 
180 185 190 195 

tgg age gga gga agt tgt ggt gtg ggg tac aag gta gtc ccg ttg gaa 681 
Trp Ser Gly Gly Ser Cys Gly Val Gly Tyr Lys Val Val Pro Leu Glu 

200 205 210 

ggg aag ttc ttg cat gtt gat gat ttg get gat get tgt gtt ttc ttg 729 
Gly Lys Phe Leu His Val Asp Asp Leu Ala Asp Ala Cys Val Phe Leu 

215 22t) 225 
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ctg gat egg ata cag egg ggg ttg gag cat gtt aac att gga agt ggt 
Leu Asp Arg He Gin Arg Gly Leu Glu His Val Asn He Gly Ser Gly 
230 235 240 



777 



caa gaa gtg act att aga gag ttg get gag ttg gtg aaa gag gtt gtt 
Gin Glu Val Thr He Arg Glu Leu Ala Glu Leu Val Lys Glu Val Val 
245 250 255 



825 



ggt ttt gaa ggg aag ctt gga tgg gat tgc act aag cca gat ggc aca 
Gly Phe Glu Gly Lys Leu Gly Trp Asp Cys Thr Lys Pro Asp Gly Thr 
260 265 270 275 



873 



ccg agg aaa ctt atg gac age tea aag etc gcg tct ttg ggt tgg aca 
Pro Arg Lys Leu Met Asp Ser Ser Lys Leu Ala Ser Leu Gly Trp Thr 

280 285 290 



921 



cct aag gtt tct ctt aga gat ggt ctg age caa act tat gat tgg tat 
Pro Lys Val Ser Leu Arg Asp Gly Leu Ser Gin Thr Tyr Asp Trp Tyr 

295 300 305 



969 



ttg aag aat gtt tgc aac cga taa gttaatggtt tctcttctca tatatacaca 1023 
Leu Lys Asn Val Cys Asn Arg 

310 315 

actattgagt ctcaggtaaa tcagcttatc accacattgt gatttaaacc tttctttgag 1083 

attcgagaat tgcttttttt tttatcaaaa ttgattcatt tagagataag acttgettet 1143 

tctcttctat acaaattatc aaaacagttg taagaagttt caagaaaaac atttgatatc 1263 

tcactaattt ggctatcctt geaagttgea aegctaaaat gacaaataat gaattctegg 1323 

cccaatgggc ttacacaagc cttgttaaag atagcgtgaa caaaacgegg ctcactagcc 1383 

ctaacctgtc tctctttcgc ttaccttctt cttcgtcttc gttggctcag tcacttgact 1443 

tcacggcccg ctcaagctct gacacgaaac tcatttcaaa ttaatttaat aaaaccttaa 1503 

tcacaaaagg ggcaaaagca atcgccggcg attatgeett ctcctccggt geeggagacg 1563 



gttgtgagcc aacccgttcg 



1583 



<210> 2 
<211> 314 
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<213> Arabidopsis thaliana 
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<400> 2 

Met Ser Asp Lys 
1 

Val Gly Ser Ala 

20 

Leu Val Leu Lys 
35 

Val Glu Ser Phe 
50 



Ser Ala Lys lie 
5 

He Val Arg Lys 



Thr His Ala Glu 

40 

Phe Ser Gin Glu 
55 



Phe Val Ala Gly 
10 

Leu Gin Glu Gin 
25 

Leu Asp Leu Thr 



Lys Pro Val Tyr 

60 



His Arg Gly Leu 
15 

Gly Phe Thr Asn 
30 

Arg Gin Ala Asp 
45 

Val He Leu Ala 



Ala Ala Lys Val Gly Gly He His 

65 .70 

i 

Phe He Gly Val Asn Leu Gin He 

85 

Tyr Glu His Gly Val Lys Lys Leu 

100 

Tyr Pro Lys Phe Ala Pro Gin Pro 
115 120 

Ala Ser Leu Glu Pro Thr Asn Glu 
130 135 

Gly He Lys Thr Cys Gin Ala Tyr 
145 150 



Ala Asn Asn Thr Tyr Pro Ala Asp 
75 80 

Gin Thr Asn Val lie His Ser Ala 
90 95 

Leu Phe Leu Gly Ser Ser Cys He 
105 110 

He Pro Glu Ser Ala Leu Leu Thr 

125 

Trp Tyr Ala lie Ala Lys lie Ala 

140 

Arg lie Gin His Gly Trp Asp Ala 
155 160 



lie Ser Gly Met 



Pro Glu Ser His 

180 



Pro Thr Asn Leu 
165 

Val Leu Pro Ala 



Tyr Gly Pro Asn 
170 

Leu Met Arg Arg 
185 



Asp Asn Phe His 
175 

Phe His Glu Ala 
190 



Lys Val Asn Trp Ser 
195 

Pro Leu Glu Gly Lys 
210 

Val Phe Leu Leu Asp 
225 



Gly Gly Ser Cys Gly Val 
200 

Phe Leu His Val Asp Asp 
215 

Arg lie Gin Arg Gly Leu 

730 235 



Gly Tyr Lys Val Val 
205 

Leu Ala Asp Ala Cys 
220 

Glu His Val Asn He 

240 
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Gly Ser Gly Gin Glu Val Thr lie Arg Glu Leu Ala Glu Leu Val Lys 

245 250 255 

Glu val Val Gly Phe Glu Gly Lys Leu Gly Trp Asp Cys Thr Lys Pro 

260 265 270 

Asp Gly Thr Pro Arg Lys Leu Met Asp Ser Ser Lys Leu Ala Ser Leu 
275 280 285 

Gly Trp Thr Pro Lys Val Ser Leu Arg Asp Gly Leu Ser Gin Thr Tyr 
290 295 300 

Asp Trp Tyr Leu Lys Asn Val Cys Asn Arg 
305 310 



<210> 3 
<211> 966 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (966) 



<400> 3 

atg agt aaa caa cga gtt ttt att get ggt cat cgc ggg atg gtc ggt 48 
Met Ser Lys Gin Arg Val Phe He Ala Gly His Arg Gly Met Val Gly 




tec gec ate agg egg cag etc gaa cag cgc ggt gat gtg gaa ctg gta 96 
Ser Ala He Arg Arg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 

20 25 30 



tta cgc acc cgc gac gag ctg aac ctg ctg gac age cgc gec gtg cat 144 
Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Arg Ala Val His 
35 40 4d 

gat ttc ttt gec age gaa cgt att gac cag gtc tat ctg gcg gcg gcg 192 
Asp Phe Phe Ala Ser Glu Arg He Asp Gin Val Tyr Leu Ala Ala Ala 
50 55 60 

aaa gtg ggc ggc att gtt gec aac aac acc tat ccg gcg gat ttc ate 240 
Lys Val Gly Gly He Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe He 
65 70 75 80 

tac cag aac atg atg att gag age aac ate att cac gee gcg cat cag 288 



5 
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Tyr Gin Asn Met 



Met He Glu Ser Asn 
85 
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He He His Ala Ala His Gin 
90 95 



aac gac gtg aac aaa ctg 
Asn Asp Val Asn Lys Leu 

100 

aaa ctg gca aaa cag ccg 
Lys Leu Ala Lys Gin Pro 
115 

ctg gag ccg act aac gag 
Leu Glu Pro Thr Asn Glu 
130 

aaa ctg tgc gaa tea tac 
Lys Leu Cys Glu Ser Tyr 
145 150 



ctg ttt etc gga teg tec 
Leu Phe Leu Gly Ser Ser 
105 

atg gca gaa age gag ttg 
Met Ala Glu Ser Glu Leu 
120 

cct tat get att gee aaa 
Pro Tyr Ala He Ala Lys 
135 140 

aac cgc cag tac gga cgc 
Asn Arg Gin Tyr Gly Arg 

155 



tgc ate tac ccg 336 
Cys He Tyr Pro 
110 

ttg cag ggc acg 384 

Leu Gin Gly Thr 

125 

ate gec ggg ate 432 
He Ala Gly He 

gat tac cgc tea 480 
Asp Tyr Arg Ser 

160 



gtc atg ccg ace 
Val Met Pro Thr 



aat teg cat gtg 
Asn Ser His Val 

180 

gca cag aat gcg 
Ala Gin Asn Ala 
195 

cgc gaa ttt ctg 
Arg Glu Phe Leu 
210 

atg gag ctg gcg 
Met Glu Leu Ala 
225 



aac ctg tac 
Asn Leu Tyr 
165 

ate cca gca 
He Pro Ala 



ccg gac gtg 
Pro Asp Val 



cac gtc gat 
His Val Asp 
215 

cat gaa gtc 
His Glu Val 
230 



ggg cca cac 
Gly Pro His 
170 

ttg ctg cgt 
Leu Leu Arg 
185 

gtg gta tgg 
Val Val Trp 
200 

gat atg gcg 
Asp Met Ala 



tgg ctg gag 
Trp Leu Glu 



gac aac ttc cac 
Asp Asn Phe His 

cgc ttc cac gag 
Arg Phe His Glu 

190 

ggc age ggt aca 
Gly Ser Gly Thr 
205 

gcg gcg age att 
Ala Ala Ser He 
220 

aac ace cag ccg 
Asn Thr Gin Pro 
235 



ccg agt 528 

Pro Ser 

175 

gcg acg 576 
Ala Thr 



ccg atg 624 
Pro Met 



cat gtc 672 
His Val 



atg ttg 720 
Met Leu 
240 



teg cac att aac 
Ser His He Asn 



gcg caa acc ate 
Ala Gin Thr He 

260 

gat gec age aaa 



gtc ggc acg ggc gtt 
Val Gly Thr Gly Val 
245 

gee aaa gtg gtg ggt 
Ala Lys Val Val Gly 

265 

ccg gat ggc acg ccg 



gac tgc act ate 
Asp Cys Thr He 
250 

tac aaa ggc egg 
Tyr Lys Gly Arg 

cgc aaa ctg ctg 



cgc gac gtg 768 
Arg Asp Val 
255 

gtg gtt ttt 816 

Val Val Phe 

270 

gat gtg acg 864 



6 
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Asp Ala ser Lys Pro Asp 
275 

cgc ctg cat cag ctt ggc 
Arg Leu His Gin Leu Gly 
290 

ctt gcc age act tac cag 
Leu Ala Ser Thr Tyr Gin 
305 310 

ggg taa 
Gly 



Gly Thr Pro Arg Lys Leu 
280 

tgg tat cac gaa ate tea 
Trp Tyr His Glu lie Ser 
295 300 

tgg ttc ctt gag aat caa 
Trp Phe Leu Glu Asn Gin 

315 



PCT/US99/11576 

Leu Asp Val Thr 
285 

ctg gaa gcg ggg 912 
L u Glu Ala Gly 

gac cgc ttt egg 960 
Asp Arg Phe Arg 

320 

966 



<210> 4 
<211> 321 
<212> PRT 

<213> Escherichia coli 



<400> 4 

Met Ser Lys Gin Arg Val Phe lie Ala Gly His Arg Gly Met Val Gly 
15 10 15 

Ser Ala lie Arg Arg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 

20 25 30 



Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Arg Ala Val His 
35 40 45 




*Asp Phe pne Ala ser Glu Arg lie Asp Gin Val Tyr Leu Ala Ala Ala 
50 55 60 



Lys Val Gly Gly lie Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe. lie 
65 70 75 " 80 

Tyr Gin Asn Met Met lie Glu Ser Asn lie lie His Ala Ala His Gin 

85 90 95 



Asn Asp Val Asn Lys Leu Leu Phe 

100 

Lys Leu Ala Lys Gin Pro Met Ala 

115 120 

Leu Glu Pro Thr Asn Glu Pro Tyr 

130 135 



Leu Gly Ser Ser Cys lie Tyr Pro 
105 110 

Glu Ser Glu Leu Leu Gin Gly Thr 

125 

Ala He Ala Lys He Ala Gly He 

140 



7 
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Lys Leu Cys Glu Ser Tyr Asn Arg Gin Tyr Gly Arg Asp Tyr Arg s r 
145 150 155 160 

Val Met Pro Thr Asn Leu Tyr Gly Pro His Asp Asn Phe His Pro Ser 

165 170 175 

Asn Ser His Val He Pro Ala Leu Leu Arg Arg Phe His Glu Ala Thr 

180 185 190 

Ala Gin Asn Ala Pro Asp Val Val Val Trp Gly Ser Gly Thr Pro Met 
195 200 205 

Arg Glu Phe Leu His Val Asp Asp Met Ala Ala Ala Ser lie His Val 
210 215 220 

Met Glu Leu Ala His Glu Val Trp Leu Glu Asn Thr Gin Pro Met Leu 
225 230 235 240 

Ser His He Asn Val Gly Thr Gly Val Asp Cys Thr He Arg Asp Val 

245 250 255 

Ala Gin Thr He Ala Lys Val Val Gly Tyr Lys Gly Arg Val Val Phe 

260 265 270 

Asp Ala Ser Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp Val Thr 
275 280 285 

Arg Leu His Gin Leu Gly Trp Tyr His Glu He Ser Leu Glu Ala Gly 
290 295 300 

Leu Ala Ser Thr Tyr Gin Trp Phe Leu Glu Asn Gin Asp Arg Phe Arg 
305 310 315 320 

Gly 



<210> 5 

<211> 1340 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (75) . . (1040) 

<400> 5 

ctagaattca gcggccgctg aattctagct agaattcagc ggccgctgaa ttctagaacc 60 

8 
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caggtgcaac tgac atg ggt gaa ccc cag gga tec atg egg att eta gtg 

Met Gly Glu Pro Gin Gly Ser Met Arg lie Leu Val 
15 10 



110 



aca ggg ggc tct ggg ctg gta ggc aaa gec ate cag aag gtg gta gca 
Thr Gly Gly Ser Gly Leu Val Gly Lys Ala lie Gin Lys Val Val Ala 
15 20 25 



158 



gat gga get gga ctt cct gga gag gac tgg gtg ttt gtc tec tct aaa 
Asp Gly Ala Gly Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys 
30 35 40 



206 



gac gee gat etc acg gat aca gca cag ace cgc gec ctg ttt gag aag 
Asp Ala Asp Leu Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys 
45 50 55 60 



254 



gtc caa ccc aca cac gtc ate cat ctt get gca atg gtg ggg ggc ctg 
Val Gin Pro Thr His Val lie His Leu Ala Ala Met Val Gly Gly Leu 

65 70 75 



302 



ttc egg aat ate aaa tac aat ttg gac ttc tgg agg aaa aac gtg cac 
Phe Arg Asn lie Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His 

80 85 90 



350 



atg aac gac aac gtc ctg cac teg gee ttt gag gtg ggg gec cgc aag 
Met Asn Asp Asn Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys 
95 100 105 



398 



gtg gtg tec tgc ctg tec acc tgt ate ttc cct gac aag acg acc tac 
^ y ^ a ^T^^^^^^^^^^^nb'^^^ ' 120 



446 



ccg ata gat gag acc atg ate cac aat ggg cct ccc cac aac age aat 
Pro lie Asp Glu Thr Met He His Asn Gly Pro Pro His Asn Ser Asn 
125 130 135 140 



ttt ggg tac teg tat gec aag agg atg ate gac gtg cag aac 
Phf* C±m Tvr Ser Tvr Ala T,v«; 2Xrrr Met Tie Asd Val Gin Asn 



Phe Gxy Tyr Ser Tyr Al 

145 



Lys Arg Met He Asp Val Gin Asn Arg Al 

150 1tc 



agg gec 
a 

155 



494 



542 



tac ttc cag cag tac ggc tgc acc ttc acc get gtc ate ccc acc aac 
Tyr Phe Gin Gin Tyr Gly Cys Thr Phe Thr Ala Val He Pro Thr Asn 

160 165 170 



590 



gtt ttc ggg ccc cac gac aac ttc aac ate gag gat ggc cac gtg ctg 
Val Phe Gly Pro His Asp Asn Phe Asn He Glu Asp Gly His Val Leu 
175 180 185 



638 



9 
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cct ggc etc ate cac aag gtg cac ctg gec aag age age ggc teg gee 686 
Pro Gly Leu lie His Lys Val His Leu Ala Lys Ser Ser Gly Ser Ala 
190 195 200 

ctg acg gtg tgg ggt aca ggg aat ccg egg agg cag ttc ata tac teg 734 
Leu Thr Val Trp Gly Thr Gly Asn Pro Arg Arg Gin Phe lie Tyr Ser 
205 210 215 220 

ctg gac ctg gee cag etc ttt ate tgg gtc ctg egg gag tac aat gaa 782 
Leu Asp Leu Ala Gin Leu Phe lie Trp Val Leu Arg Glu Tyr Asn Glu 

225 230 235 

gtg gag ccc ate ate etc tec gtg ggc gag gaa gat gag gtc tec ate 830 
Val Glu Pro He He Leu Ser Val Gly Glu Glu Asp Glu Val Ser He 

240 245 250 

aag gag gca gee gag gcg gtg gtg gag gee atg gac ttc cat ggg gaa 878 
Lys Glu Ala Ala Glu Ala Val Val Glu Ala Met Asp Phe His Gly Glu 
255 260 265 

gtc ace ttt gat aca acc aag teg gat ggg cag ttt aag aag aca gee 926 
Val Thr Phe Asp Thr Thr Lys Ser Asp Gly Gin Phe Lys Lys Thr Ala 
270 275 280 

agt aac age aag ctg agg acc tac ctg ccc gac ttc egg ttc aca ccc 974 
Ser Asn Ser Lys Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe Thr Pro 
285 290 295 300 

ttc aag cag gcg gtg aag gag acc tgt get tgg ttc act gac aac tac 1022 
Phe Lys Gin Ala Val Lys Glu Thr Cys Ala Trp Phe Thr Asp Asn Tyr 

305 310 315 

gag cag gec egg aag tga agctggaaga caggatcagg tgccagcgga 1070 
Glu Gin Ala Arg Lys 

320 

ccatcggctg gcagagccca gcggccacca cccgtcaacc ctgecaggag ctgagggcac 1130 

cacccagcaa cctgggcctg cattccatcc gctctgcagc cccaagcatc tttccagtgg 1190 

ggcccccatt cacgttggtc ctcagggaaa ccagggtccg gggcaggccc ggcgctttgc 1250 

tccccacacc agccccctgc gcgtgtccac tctgatcctg catcccactc cctgggagcc 1310 

aataaagtgc attttcacag aaaaaaaaaa 1340 



<210> 6 

10 



NSDOCID: <W0 99646 18Al_l_> 



wu yy/oioio 

<211> 321 
<212> PRT 
<213> Homo sapiens 



PCT/US99/11576 



<400> 6 

Met Gly Glu Pro Gin Gly Ser Met Arg lie Leu Val Thr Gly Gly Ser 
15 10 15 

Gly Leu Val Gly Lys Ala lie Gin Lys Val Val Ala Asp Gly Ala Gly 

20 25 30 

Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu 
35 40 45 

Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr 
50 55 60 

His Val lie His Leu Ala Ala Met Val Gly Gly Leu Phe Arg Asn He 
65 70 75 80 

Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His Met Asn Asp Asn 

85 **90 95 

Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys 

100 105 110 

Leu ser Thr Cys He Phe Pro Asp Lys Thr Thr Tyr Pro He Asp Glu 
115 120 125 

Thr Met He His Asn Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser 
Tyr Ala Lys Arg Met He Asp Val Gin Asn Arg Ala Tyr Phe Gin Gin 



145 



150 



155 



160 



Tyr Gly Cys Thr Phe Thr Ala Val He Pro Thr Asn Val Phe Gly Pro 

165 170 175 

His Asp Asn Phe Asn lie Glu Asp Gly His Val Leu Pro Gly Leu He 

180 185 190 

His Lys Val His Leu Ala Lys Ser Ser Gly Ser Ala Leu Thr Val Trp 
195 200 205 



Gly Thr Gly Asn Pro Arg Arg Gin Phe He Tyr Ser Leu Asp Leu Ala 
210 215 220 



Gin Leu Phe He Trp Val Leu Arg 

11 



Glu Tyr Asn Glu Val Glu Pro He 



WO 99/64618 PCT/US99/11576 

225 230 235 240 



He Leu Ser Val Gly Glu Glu Asp 

245 

Glu Ala Val Val Glu Ala Met Asp 

260 

Thr Thr Lys Ser Asp Gly Gin Phe 

275 280 

Leu Arg Thr Tyr Leu Pro Asp Phe 
290 295 

Val Lys Glu Thr Cys Ala Trp Phe 
305 310 

Lys 



Glu Val Ser He Lys Glu Ala Ala 
250 255 

Phe His Gly Glu Val Thr Phe Asp 
265 270 

Lys Lys Thr Ala Ser Asn Ser Lys 

285 

Arg Phe Thr Pro Phe Lys Gin Ala 

300 

Thr Asp Asn Tyr Glu Gin Ala Arg 
315 320 



<210> 7 
<211> 1017 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1017) 

<400> 7 

atg aga gtt ctg gtt acc ggt ggt age ggt tac att gga agt cat acc 4 8 

Met Arg Val Leu Val Thr Gly Gly Ser Gly Tyr He Gly Ser His Thr 
15 10 15 

tgt gtg caa tta ctg caa aac ggt cat gat gtc ate att ctt gat aac 96 
Cys Val Gin Leu Leu Gin Asn Gly His Asp Val He He Leu Asp Asn 

20 25 30 

etc tgt aac agt aag cgc age gta ctg cct gtt ate gag cgt tta ggc 144 
Leu Cys Asn Ser Lys Arg Ser Val Leu Pro Val He Glu Arg Leu Gly 
35 40 45 

ggc aaa cat cca acg ttt gtt gaa ggc gat att cgt aac gaa gcg ttg 192 
Gly Lys His Pro Thr Phe Val Glu Gly Asp He Arg Asn Glu Ala Leu 
50 55 60 

atg acc gag ate ctg cac gat cac get ate gac acc gtg ate cac ttc 240 
Met Thr Glu He Leu His Asp His Ala lie Asp Thr Val He His Phe 

12 
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65 "70 75 80 

gcc ggg ctg aaa gcc gtg ggc gaa teg gta caa aaa ccg ctg gaa tat 288 

Ala Gly Leu Lys Ala Val Gly Glu Ser Val Gin Lys Pro Leu Glu Tyr 

85 90 95 

tac gac aac aat gtc aac ggc act ctg cgc ctg att age gcc atg cgc 336 

Tyr Asp Asn Asn Val Asn Gly Thr Leu Arg Leu lie Ser Ala Met Arg 

100 105 110 

gcc get aac gtc aaa aac ttt att ttt age tec tec gcc ace gtt tat 384 

Ala Ala Asn Val Lys Asn Phe lie Phe Ser Ser Ser Ala Thr Val Tyr 

115 120 125 

ggc gat cag ccc aaa att cca tac gtt gaa age ttc ccg ace ggc aca 432 

Gly Asp Gin Pro Lys He Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 

130 135 140 



ccg caa age cct tac ggc aaa age aag ctg atg gtg gaa cag ate etc 
Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met Val Glu Gin He Leu 
145 150 155 160 



480 



acc gat ctg caa aaa gcc cag ccg gac tgg age att gcc ctg ctg cgc 
Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 

165 170 175 



528 



tac ttc aac ccg gtt ggc gcg cat ccg teg ggc gat atg ggc gaa gat 
Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 

180 185 190 



576 




ccg caa gge att ccg aat aac ctg atg cca tac ate gcc cag gtt get 
Pro Gin Gly He Pro Asn Asn Leu Met Pro Tyr He Ala Gin Val Ala 
195 200 205 



624 



gta ggc cgt cgc gac teg ctg gcg att ttt ggt aac gat tat ccg acc 
Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 



672 



gaa gat ggt act ggc gta cgc gat tac ate cac gta atg gat ctg gcg 
Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 
225 230 235 240 



720 



gac ggt cac gtc gtg gcg atg gaa aaa ctg gcg aac aag cca ggc gta 
Asp Gly His Val Val Ala Met Glu Lys Leu Ala Asn Lys Pro Gly Val 

245 250 255 



768 



cac ate tac aac etc ggc get ggc gta ggc aac age gtg ctg gac gtg 
His He Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 



816 



13 
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260 

gtt aat gcc ttc age aaa 
Val Asn Ala Phe Ser Lys 
275 

gca ccg cgt cgc gag ggc 
Ala Pro Arg Arg Glu Gly 
290 



265 

gcc tgc ggc aaa ccg gtt 
Ala Cys Gly Lys Pro Val 
280 

gac ctt ccg gcc tac tgg 
Asp Leu Pro Ala Tyr Trp 
295 300 



PCT/US99/11576 

270 

aat tat cat ttt 864 

Asn Tyr His Phe 

285 

gcg gac gcc age 912 

Ala Asp Ala Ser 



aaa gcc gac cgt gaa ctg aac tgg cgc 
Lys Ala Asp Arg Glu Leu Asn Trp Arg 
305 310 

atg gcg cag gac acc tgg cac tgg cag 
Met Ala Gin Asp Thr Trp His Trp Gin 

325 

ccc gat taa 
Pro Asp 



gta acg cgc aca etc gat gaa 960 
Val Thr Arg Thr Leu Asp Glu 
315 320 

tea cgc cat cca cag gga tat 1008 
Ser Arg His Pro Gin Gly Tyr 
330 335 

1017 



<210> 8 
<211> 338 
<212> PRT 

<213> Escherichia coli 
<400> 8 

Met Arg Val Leu Val Thr Gly Gly Ser Gly Tyr lie Gly Ser His Thr 
15 10 15 

Cys Val Gin Leu Leu Gin Asn Gly His Asp Val lie lie Leu Asp Asn 

20 25 30 

Leu Cys Asn Ser Lys Arg Ser Val Leu Pro Val He Glu Arg Leu Gly 
35 40 45 

Gly Lys His Pro Thr Phe Val Glu Gly Asp He Arg Asn Glu Ala Leu 
50 55 60 

Met Thr Glu He Leu His Asp His Ala He Asp Thr Val He His Phe 
65 70 75 80 

Ala Gly Leu Lys Ala Val Gly Glu Ser Val Gin Lys Pro Leu Glu Tyr 

85 90 95 

Tyr Asp Asn Asn Val Asn Gly Thr Leu Arg Leu He Ser Ala Met Arg 

100 105 110 

14 
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Ala Ala Asn Val Lys Asn Phe lie Phe Ser Ser Ser Ala Thr Val Tyr 
115 120' 125 

Gly Asp Gin Pro Lys lie Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 
130 135 140 

Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met Val Glu Gin He Leu 
145 150 155 160 

Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 

165 170 175 

Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 

180 185 190 

Pro Gin Gly He Pro Asn Asn Leu Met Pro Tyr He Ala Gin Val Ala 

195 200 /> 205 

Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 

» - 

Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 
225 230 235 240 

Asp Gly His Val Val Ala Met Glu Lys Leu Ala Asn Lys Pro Gly Val 

245 250 255 

His He Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 

26^^^^^^^^^^26^^^^^^^^^^270 

Val Asn Ala Phe Ser Lys Ala Cys Gly Lys Pro Val Asn Tyr His Phe 
275 280 285 

Ala Pro Arg Arg Glu Gly Asp Leu Pro Ala Tyr Trp Ala Asp Ala Ser 
290 295 % 300 

■ r .- ■ 

Lys Ala Asp Arg Glu Leu Asn Trp Arg Val Thr Arg Thr Leu Asp Glu 
305 310 315 320 

Met Ala Gin Asp Thr Trp His Trp Gin Ser Arg His Pro Gin Gly Tyr 

325 330 335 



Pro Asp 



<210> 9 
<211> 1047 
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<212> DMA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1047) 

<400> 9 

atg gca gag aag gtg ctg gta aca ggt ggg get ggc tac att ggc age 48 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
15 10 15 

cac acg gtg ctg gag ctg ctg gag get ggc tac ttg cct gtg gtc ate 96 
His Thr Val Leu Glu Leu Leu Glu Ala Gly Tyr Leu Pro Val Val lie 

20 25 30 



gat aac ttc cat 
Asp Asn Phe His 
35 

ctg egg egg gtc 
Leu Arg Arg Val 
50 

atg gac att ttg 
Met Asp lie Leu 
65 

age ttt atg gcg 
Ser Phe Met Ala 



teg gtg cag aag 
Ser Val Gin Lys 

100 

ate cag ctt ctg 
He Gin Leu Leu 
115 

ttc age age tea 
Phe Ser Ser Ser 
130 

gat gag gee cac 
Asp Glu Ala His 
145 



aat gee ttc 
Asn Ala Phe 



cag gag ctg 
Gin Glu Leu 
55 

gac cag gga 
Asp Gin Gly 
70 

gtc ate cac 
Val lie His 
85 

cct ctg gat 
Pro Leu Asp 

gag ate atg 
Glu He Met 



gee act gtg 
Ala Thr Val 
135 

ccc acg ggt 
Pro Thr Gly 
150 



cgt gga ggg 
Arg Gly Gly 
40 

aca ggc cgc 
Thr Gly Arg 

gee eta cag 
Ala Leu Gin 

ttt gcg ggg 
Phe Ala Gly 
90 

tat tac aga 
Tyr Tyr Arg 
105 

aag gee cac 
Lys Ala His 
120 

tac ggg aac 
Tyr Gly Asn 

ggt tgt ace 
Gly Cys Thr 



ggc tec ctg cct 
Gly Ser Leu Pro 
45 

tct gtg gag ttt 
Ser Val Glu Phe 
60 

cgt etc ttc aaa 
Arg Leu Phe Lys 
75 

etc aag gee gtg 
Leu Lys Ala Val 



gtt aac ctg ace 
Val Asn Leu Thr 

110 

ggg gtg aag aac 
Gly Val Lys Asn 
125 

ccc cag tac ctg 
Pro Gin Tyr Leu 
140 

aac cct tac ggc 
Asn Pro Tyr Gly 
155 



gag age 144 
Glu Ser 



gag gag 192 
Glu Glu 

aag tac 240 
Lys Tyr 
80 

ggc gag 288 
Gly Glu 
95 

ggg acc 336 
Gly Thr 



ctg gtg 384 
Leu Val 

ccc ctt 432 

t 

Pro Leu 

aag tec 480 
Lys Ser 
160 



16 
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aag ttc ttc ate gag gaa atg ate egg gac ctg tgc cag gca gac aag 
Lys Phe Phe II Glu Glu Met He Arg Asp Leu Cys Gin Ala Asp Lys 

165 170 175 



528 



act tgg aac gta gtg ctg ctg cgc tat ttc aac ccc aca ggt gec cat 
Thr Trp Asn Val Val Leu Leu Arg Tyr Ph Asn Pro Thr Gly Ala His 

180 185 190 



576 



gec tct ggc tgc att ggt gag gat ccc cag ggc ata ccc aac aac etc 
Ala Ser Gly Cys He Gly Glu Asp Pro Gin Gly He Pro Asn Asn Leu 
195 200 205 



624 



atg cct tat gtc tec cag gtg gcg ate ggg cga egg gag gee ctg aat 
Met Pro Tyr Val Ser Gin Val Ala He Gly Arg Arg Glu Ala Leu Asn 
210 215 220 



672 



gtc ttt ggc aat gac tat gac aca gag gat ggc aca ggt gtc egg gat 
Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 



720 



tac ate cat gtc gtg gat ctg gec aag ggc cac att gca gee tta agg 
Tyr He His Val Val Asp Leu Ala Lys Gly His He Ala Ala Leu Arg 

245 250 255 



768 



aag ctg aaa gaa cag tgt ggc tgc egg ate tac aac ctg ggc acg ggc 
Lys Leu Lys Glu Gin Cys Gly Cys Arg He Tyr Asn Leu Gly Thr Gly 

260 265 270 



816 



aca ggc tat tea gtg ctg cag atg gtc cag get atg gag aag gec tct 
Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala Ser 



864 



ggg aag aag ate ccg tac aag gtg gtg gca egg egg gaa ggt gat gtg 
Gly Lys Lys He Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 



912 



gca gec tgt tac gec aac ccc age ctg gee caa gag gag ctg ggg tgg 
Ala Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 



960 



aca gca gec tta ggg ctg gac agg atg tgt gag gat etc tgg cgc tgg 
Thr Ala Ala Leu Gly Leu Asp Arg Met Cys Glu Asp Leu Trp Arg Trp 

325 330 335 



1008 



cag aag cag aat cct tea ggc ttt ggc acg caa gee tga 
Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 

340 345 



1047 
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<210> 10 
<211> 348 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
1 5 10 15 

His Thr Val Leu Glu Leu Leu Glu Ala Gly Tyr Leu Pro Val Val lie 

20 25 30 

Asp Asn Phe His Asn Ala Phe Arg Gly Gly Gly Ser Leu Pro Glu Ser 
35 40 45 

Leu Arg Arg Val Gin Glu Leu Thr Gly Arg Ser Val Glu Phe Glu Glu 
50 55 60 

Met Asp He Leu Asp Gin Gly Ala Leu Gin Arg Leu Phe Lys Lys Tyr 
65 70 75 80 

Ser Phe Met Ala Val He His Phe Ala Gly Leu Lys Ala Val Gly Glu 

85 90 95 

Ser Val Gin Lys Pro Leu Asp Tyr Tyr Arg Val Asn Leu Thr Gly Thr 

100 105 110 

He Gin Leu Leu Glu He Met Lys Ala His Gly Val Lys Asn Leu Val 
115 120 125 

Phe Ser Ser Ser Ala Thr Val Tyr Gly Asn Pro Gin Tyr Leu Pro Leu 
130 135 140 

Asp Glu Ala His Pro Thr Gly Gly Cys Thr Asn Pro Tyr Gly Lys Ser 
145 150 155 160 

Lyt. Phe Phe He Glu Glu Met He Arg Asp Leu Cys Gin Ala Asp Lys 

165 170 175 

Thr Trp Asn Val Val Leu Leu Arg Tyr Phe Asn Pro Thr Gly Ala His 

180 185 190 

Ala Ser Gly Cys He Gly Glu Asp Pro Gin Gly He Pro Asn Asn Leu 
195 200 205 

Met Pro Tyr Val Ser Gin Val Ala He Gly Arg Arg Glu Ala Leu Asn 
210 215 220 

18 
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Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 

Tyr He His Val Val Asp Leu Ala Lys Gly His He Ala Ala Leu Arg 

245 250 255 

Lys Leu Lys Glu Gin Cys Gly Cys Arg He Tyr Asn Leu Gly Thr Gly 

260 265 270 

Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala Ser 
275 280 265 

Gly Lys Lys He Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 

Ala Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 

Thr Ala Ala Leu Gly Leu Asp Arg Met Cys Glu Asp Leu Trp Arg Trp 

325 3J0 335 

Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 

340 345 



<210> 11 
<211> 317 
<212> PRT 
<2 1 3 *-Ar.ti f-Lci-a 1 




<220> 

<223> Description of Artificial Sequence : CONSENSUS 



<400> 11 

Xaa Xaa Arg Xaa Xaa Xaa Xaa Gly Xaa Xali Gly Xaa Xaa Gly Xaa Xaa 
15 10 15 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa Xaa Ala Xaa Xaa Xaa 
50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



19 
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65 70 75 80 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa Xaa Xaa Pro Xaa Xaa Glu Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
145 150 155 160 



Xaa Xaa Asn Xaa Xaa 

165 

Xaa Xaa Xaa Pro Xaa 

180 

Xaa Xaa Xaa Xaa Xaa 
195 

Xaa Xaa Xaa Xaa Xaa 
210 

Xaa Xaa Xaa Xaa Xaa 
225 

Xaa Xaa Xaa Gly Xaa 

245 

Xaa Xaa Xaa Xaa Xaa 

260 



Gly Xaa His Xaa Xaa Xaa 

170 

Xaa Xaa xaa Xaa Xaa Xaa 

185 

Xaa Xaa Xaa Gly Xaa Gly 
200 

Asp Xaa Ala Xaa Xaa Xaa 
215 

Xaa Xaa Xaa Xaa Xaa Xaa 
230 235 

Xaa Xaa Xaa Xaa Xaa Xaa 

250 

Xaa Xaa Xaa Xaa Xaa Xaa 

265 



Xaa Xaa Xaa Xaa Xaa 

175 

Xaa Xaa Xaa Xaa Xaa 
190 

Xaa Xaa Xaa Arg Xaa 
205 

Xaa Xaa Xaa Xaa Xaa 
220 

Xaa Xaa Xaa Xaa Xaa 

240 

Xaa Xaa Xaa Xaa Xaa 

255 

Xaa Xaa Phe Xaa Xaa 
270 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

275 280 285 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

290 295 300 

Xaa Thr Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

305 310 315 



V 



20 



NSDOCID: <W0 . _. 996461 8A1. J _> 



WO 99/64618 

<210> 12 
<211> 34 
<212> DNA 

<213> Escherichia coli 



PCT/US99/11576 



<400> 12 

tagaattcag taaacaacga gtttttattg ctgg 



34 



<210> 13 
<211> 32 
<212> DNA 

<213> Escherichia coli 



<400> 13 

aactcgagtt acccccaaag cggtcttgat tc 



32 



<210> 14 
<211> 30 
<212> DNA 

<213> Escherichia coli 



<400> 14 

ctggagtcga attcatgagt aaacaacgag 



30 



<210> 15 
<211> 33 

<213> Escherichia coli 



<400> 15 

aactgcagtt acccccgaaa gcggtcttga ttc 



33 
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comprising culturing a microorganism having a genetic modification to increase the action of an enzyme selected from 
the group consisting of hexokinasc, glucose phosphate isomcrase etc. as well as a microorganism genetically modified 
for producing ascorbic acid. 

Group II, claims 60-70 and 72, drawn to a plant for producing ascorbic acid or esters thereof, wherein said plant has a 
genetic modification to increase the action of an enzyme selected from the group consisting of hexokinasc, glucose 
phosphate isomcrase etc. 

The inventions listed as Groups MI do not relate to a single inventive concept because tbey are considered to be two 
different categories of invention and arc not drawn to combination of categories (i.e. categories 1-5), specified in 37 CFR 
section 1.475(b). 
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